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Improved cytokine design 



The present invention relates to novel methods for the design of proteins, in particular, cytokines. 
These mefliods allow the stabilisation of such cytokines, as well as modification of their 
selectivity/specificity for their cognate receptors. The invention also relates to various modified 
5 proteins that have been designed by the methods of the invention. 

Cytokines are a family of growth factors secreted primarily from leukocytes, and are messenger 
proteins that act as potent regulators capable of effecting cellular processes at sub-nanomolar 
concentrations. Their size allows cytokines to be quickly transported around the body and degraded 
when required. Their role in controlling a wide range of cellular functions, especially the immune 

10 response and cell growth has been revealed by extensive research over the last twenty years 
(Boppana, S.B (1996) Indian. J. Pediatr. 63(4):447-52). These roles include immune response 
regulation (Nishihira, J. (1998) Int. J. MoL Med. 2(1): 17-28), inflammation (Kim, P.K. et al (2000) 
Surg. Clin. North. Am. 80(3):885-894), wound healing (Clark, R,A. (1991) J. Cell Biochem. 
46(1): 1-2), embryogenesis and development, and apoptosis (Flad, H.D. et al (1999) Pathobiology. 

15 67(5-6):291-293). 

Clinical use of cytokines to date has focused on their role as regulators of the immune system 
(Rodriguez, F.H. et al (2000) Curr. Pharm. Des. 6(6):665-680) for instance in promoting a response 
against thyroid cancer (Schmutzler, C. et al (2000) 143(1):15"24). Their control of cell growth and 
differentiation has also made cytokines anti-cancer targets (Lazar-Molnar, E. et al (2000) Cytokine. 
20 12(6):547-554; Gado, K. (2000) 24(4): 195-209). Novel mutations in cytokines and cytokuie 
receptors have been shown to confer disease resistance in some cases (van Deventer, S.J. et al 
(2000) Intensive Care Med. 26(Suppl 1):S98:S102). The creation of synthetic cytokines (muteins) 
in order to modulate activity and remove potential side effects has also been an important avenue of 
research (Shanafelt, A3, et al (1998) 95(16):9454-9458). 

25 Cytokine molecules have thus been shown to play a role in diverse physiological functions, many 
of which play a role in disease processes. Alteration of their activity is a means to alter the disease 
phenotype and as such, the identification of novel cytokine molecules is of significant scientific 
interest. 

Of particular interest are ligands that belong to the Tumor Necrosis Factor ligand (TNF) family; 
30 these proteins are involved in a wide range of biological activities, ranging from cell proliferation 
to apoptosis. 

Members of the TNF ligand family induce signallmg pathways that lead to apoptosis or 
programmed cell death (PCD) through interaction with their cognate receptors. Ligand-bound 
receptors transmit the signal across the membrane by bringing their cytoplasmic portions into close 



proximity, leading to the recruitment and activation of downstream effector proteins. Apoptosis is a 
process fundamental to the normal development and homeostasis of multicellular organisms. 
However, the impairment of apoptosis regulation is implicated in the pathogenesis of cancer and 
several chronic diseases, including acquired immunodeficiency syndrome (autoimmune disease and 
5 AIDS) and neurodegenerative disorders (eg Parkinsons). Common examples are chronic transplant 
dysfunction, rheumatoid arthritis, chronic obstructive pulmonary disease (COPD) and asthma. 
Molecules that mediate reversal of imbalance in signal transduction could be effective therapeutics 
in diseases. 

Members of the TNF ligand family are also master conductors of immune function and immune 
10 tolerance. There is a complex balance between immunostimulatory and immunoregulatory 
functions within this family that ensures appropriate immune responses. Genetic polymorphisms in 
the TNF ligand receptor family can result in deregulation of immune homeostasis. Such 
deregulation can lead to pathogenesis. 

The TNF family of ligands interact with their cognate receptors to trigger several signalling 
1 5 pathways that play key roles in regulatory and deleterious effects on immune tolerance^, in addition 
to both protective and pathogenic effects on tissues (Rieux-Laucat et a/., 2003, Current Opinion in 
Immunology 15:325; Mackay and Ambrose, 2003, Cytokine and growth factor reviews, 14: 311; 
Mackay and Railed, 2002, Current opinion in Immunology, 14: 783-790). Examples of such 
proteins include ligands such as RANKL, TRAIL, and APRIL, which are implicated in disease 
20 conditions such as rheumatoid arthritis, autoimmune diabetes, systemic lupus etythematosus (SLE), 
Sjorgen's syndrome, experimental autoimmune encephalomyelitis (EAE), inflammatory bowel 
disease (IBD), autoimmune lymphoproliferative syndrome (ALPS) and multiple sclerosis. 

All monomeric subunits of TNF ligand family members consist of antiparallel P-sheets, organised 
in a jellyroU topology, and these subunits self associate in bell-shaped homotrimers, the bioactive 

25 form of the ligand. Sequence homology is highest between the (aromatic) residues responsible for 
trimer formation. A trimer binds three subunits of a cognate receptor, each receptor subunit binding 
in the grooves between two adjacent monomer subunits. The ligands are type II transmembrane 
proteins, but the extracellular domain of some members can be proteolytically cleaved from the cell 
surface, yielding a bioactive soluble form of the ligand. Recent reviews of the TNF ligand family 

30 are readily available (Locksley et al. Cell 104, 487-501 (2001); Bodmer et aL, Trends Biochem. 
Sci. 27, 19-26 (2002)). 

Knowledge of members of the TNF ligand family can also be applied to other signalling pathways 
triggered by ligand-receptor interaction within the same and olher families. 

Although naturally-occurring cytokines are of significant interest to the scientific community, the 
35 properties of many of these molecules do not necessarily suit their application in a clinical setting. 




For example, the stability of a cytokine is important throughout the production process and for tnb 
shelf-life of the final product, as well as influencing the pharmacokinetic and -dynamic properties 
of the protein therapeutic (Marshall et al. Drug Discov. Today 8, 212-221 (2003)). Several 
strategies are currently used to augment the thermal stability of proteins (Fersht^A. & Winter,G. 
5 Protein engineering. Trends Biochem. Set 17, 292-295 (1992); Van den Burg et al, Curr, Opin. 
Biotechnol 13, 333-337 (2002)). Both rational (Pantoliano et ah. Biochemistry 26, 2077-2082 
(1987); Van den Burg et ah, Proc. Natl Acad. Sci. U. S. A 95, 2056-2060 (1998); Villegas et al. 
Fold. Des 1, 29-34 (1996)) and directed evolution methods (Giver et aL, Proc. Natl Acad. Sci. U. 
S, A 95, 12809-12813 (1998); Jung et ah, J> Mol Biol 294, 163-180 (1999)) have been 
1 0 successfully used to improve stability. A disadvantage of a rational approach is that one can design 
only a limited number of potentially improved variants. In contrast, directed evolution methods 
allow large numbers of variants to be generated and screened. However, suitable 
selection/screening procedures are required, which are often not available or are very labour 
intensive. 

15 More recently, computational redesign algorithms have been employed to enhance stability, 
amongst other properties, of proteins (DeGrado et ah, Annu. Rev. Biochem. 68, 779-819 (1999)). 
These methods combine computer design steps with in silico screening, permitting screening of a 
much larger sequence space than is experimentally possible with high-throughput techniques. 
Efficient algorithms are needed to search the vast sequence space and accurate scoring functions 

20 are required in order to rank the best designs (Dantas et ah, J. Mol. Biol. 332, 449-460 (2003)). 
Some limited success has been achieved for certain proteins - for example, computational redesign 
has recently been used to generate a hyper-thermophilic variant of streptococcal Gpl domain 
protein (Malakauskas, S.M. & Mayo, S.L. Nat. Struct. Biol. 5, 470-475 (1998)), to enhance the 
stability of the spectrin SH3 domain (Ventura et ah, Nat. Struct. Biol 9, 485-493 (2002)) and to 

25 improve the (thermal) stability of the therapeutically interesting four helix bundle cytokines, 
granulocytecolony stimulation factor (G-CSF) and human growth hormone (hGH). However, the 
various shortcomings of these methods do not allow their widespread application. 

It would also be of great use were it possible to alter the selectivity/specificify of cytokines for their 
receptors. For example, certain members of the TNF ligand family bind more than one receptor or 

30 bind decoy receptors which lack or have truncated intracellular domains. A specific example is the 
tumor necrosis factor-related apoptosis inducing ligand (TRAIL; TNFSFIO) Wiley et al. Immunity. 
3, 673-682 (1995); Pitti et ah, ^ Biol Chem. 271, 12687-12690 (1996)), which in its soluble form 
binds its receptors DR4 (TRAIL-Rl) and DR5 (TRAIL-R2), in addition to its decoy receptors, 
DcRl (TRAIL-R3), DcR2 (TRAIL-R4) and OPG. Receptors DR4 (TRABL-Rl) and DR5 (TRAIL- 

35 R2) contain the cytoplasmic Fas associated death domain (FADD), and binding of TRAIL to these 
receptors induces apoptosis. TRAIL also appears to be able to induce the proliferative NF-kB 
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pathway through Tumor necrosis factor receptor associated death domain (TRADD). Having 
selective inducers of DR4 (TRAIL-Rl) and DR5 (TRAIL-R2) signalling is likely to be of great 
interest, due to the presumably different cross-linking requirements of both death receptors. 
Depending on the cross-linking, the signalling pathway could induce the proliferative or the apoptic 
5 pathway. DcRl (TRAIL-R3) and DcR2 CrRAIL-R4) do not contain a death domain or contain a 
truncated death domain, respectively. Binding to these receptors does not induce apoptosis; on the 
contrary, it may actually prevent apoptosis by sequestering available TRAIL. DcR2 (TRAIL-R4) 
however appears also be capable of inducing the NF-kB pathway. Unlike other apoptosis inducing 
TNF family members, TRAIL appears to be mactive against normal healthy tissue, therefore 
1 0 attracting great interest as a potential cancer therapeutic (Ashkenazi et al., J. Clin. Invest 104, 155- 
162 (1999)). A recent significant publication has however shown that TRAIL-R3 is upregulated by 
p53 in bi«ast tumour cell through use of the genotoxic drug, doxorubicin (Ruiz de Almod6var et 
al. 2003, Nov 17, Manuscript MS 11243200). This implies that efficacy of wild-type TRAIL may 
be diminished in anti-tumour therapy since it also binds the decoy receptors (that do not initiate 
1 5 apoptosis). Therefore, variants of TRAIL, that have altered selectivity/specificity could be direct to 
the pio-apoptotic receptors, DR4 (TRAIL-Rl) or DR5 (TRAIL-R2) and would have ultimately 
improved application in cancer treatment. 

The TNF ligand family members, such as TRAIL, APRIL, RANKL BAFF, LIGHT, FasL and 
TNF-a all bind more than one receptor. At least six death-domain-contaming receptors have now 
20 been identified — Fas, TNF receptor 1 (TNF-Rl), death receptor 3 (DR3; also known as TRAMP, 
Wsl, APO-3 and LARD), the two receptors for TNF-related apoptosis-mducing ligand (TRAIL) 
TRAIL-R1/DR4 and TRAIL-R2ADR5, and DR6. Certain member of the TNF Ugand famUy also 
bind decoy receptors which lack or have truncated death domams, such as, TRAIL, which binds its 
decoy receptors, DcRl, DcR2 and OPG. The accumulation of recent knowledge in this area, 
25 therefore, opens new avenues for therapeutic design. In this respect, selectivity of novel molecules 
is of primary unportance to discern the specific role of the activation of different receptors and 
therefore the functional effects of Ugand binding to several receptors, and the concomitant 
mfluence on the pathogenesis of the associated diseases related to signal activation. 
Rational design permits only a relative small amount of variants to be designed. For molecular 
30 evolution/high throughput screening (HTS) methods, selection and screening methods have to be 
developed and large libraries of variants have to be screened. Especially for enhancement of 
stability there are relatively few examples of successful selection or screening methods. 
Computational design methods known in the art have only been applied to improve the stability of 
relatively small monomeric molecules, not to improve the stability of larger multimeric molecules. 
35 Often these designs are focused on changmg amino acids in the core of the molecule. Changing 




residues in the core (repacking the core) does stabilise molecules to some extent, but can lead to^ 
molten globule state. 

Computational design methods have also been used in the art to alter selectivity, but again only for 
relatively small monomeric molecules, not to improve the selectivity for binding partners of larger 
5 multimeric molecules. Furthermore, crystal structure is currently indispensable. 

One aspect of the present invention uses a combination of computational redesign algorithms and 
educated manual input to design proteins that are more stable than their wild-type counterparts. 

A second aspect of the invention relates to the redesign of proteins to alter their selectivity for 
receptor. 

1 0 Summary of the invention 

According to a first aspect of the invention, there is provided a computer-implemented method for 
the stabilisation of a p sheet multimeric cytokine, comprising the step of: 

mutating a residue in a monomer component of the multimeric cytokine protein so as to improve 
the free energy of the monomer or of the multimeric complex relative to the wild-type unmutated 
15 monomer component, wherein said mutated residue is non-conserved between homologous 
members of the cytokine family. 

Variants of p sheet multimeric cytokines with enhanced stability have a number of advantages, 
including increased in vivo and in vitro half-lives, increased yields generated during protein 
expression, greater stability during purification and an extended shelf-life compared to their wild- 

20 type counterparts. Stable variants of these proteins can thus be used as protein therapeutics or 
diagnostics. The proteins have a relatively close resemblance to the wild-type structure and this 
reduce the risk of immunogenicity, particularly when compared to variants stabilised by fusion 
tags, one currently favoured method of stabilising proteins. They also have advantages when 
compared to agonistic or antagonistic antibodies. In contrast to antibodies, variants can be 

25 produced in Escherichia coli and the mode of signalling more closely resembles that used by the 
wild-type cytokine protein. 

The term "multimeric cytokine" as used herein is meant to include all p sheet multimeric cytokines. 
Examples of such cytokines are presented in Table 6. Other examples will be known to those of 
skill in the art. A recent review on structure of TNF ligand family is available (Bodmer et al, 2002. 
30 Trends Biochem. Sci. 27, 19). 

One feature of P sheet multimeric cytokines is that they are composed of identical monomeric 
subunits or of different monomeric subunits. Methodologies could be applied to all cytokine 
protein families and more specifically to the members of the TNF ligand-receptor family. Other 
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examples of families of proteins embraced by the superfamily of cytokines include those classed as 
Beta-Trefoil, Beta-sandwich, EGF-like, and Cystine knot cytokines). 

Of particular interest for the application of the methodology of the invention are the p sheet 
multimeric cytokines that are members of the tumor necrosis factor ligand family. Ligands 
5 belonging to this family are involved in a wide range of biological activities, ranging from cell 
proliferation to apoptosis, and they share similar structural characteristics. All monomeric subunits 
of these ligands consist of antiparallel p-sheets, organized in a jelly-roll topology, and these 
subunits self associate in bell-shaped homotrimers, the bioactive form of the ligand. A trimer binds 
three subunits of a cognate receptor, each receptor subimit binding in the grooves between two 
1 0 adjacent monomer subunits. The ligands are type 11 transmembrane proteins, but tihe extracellular 
domain of some members can be proteolj^cally cleaved from the cell surface, yielding a bioactive 
soluble form of the ligand. 

It has been found advantageous to use alignment information in order to focus the design on non- 
conserved residue positions. This method of protein stabilisation focuses on these non-conserved 
1 5 residues on the premise that conserved residues are usually retained in a family for a good reason 
and it is probable that any mutation of a conserved residue will decrease protein stability. On the 
other hand, regions with high sequence variability are tolerant to mutation and it can be expected 
that variants that stabilize the protein can be found in these regions. There is less evolutionary 
pressure for these residues to have been retained among the family members. 

20 The combined approach of the method therefore employs family alignment information and a 
computational design algorithm. This reduces the sequence space search for every position in the 
protein being studied and decreases the computing time and power necessary for the methodology. 

Identification of non-conserved residues can be done using any one of a number of systems known 
to the person of skill in the art. Such an analysis can be done by eye, but is more easily achieved 

25 using a computer-implemented alignment algorithm, such as BLAST (Altschul et al (1990) J Mol 
Biol., 215(3): 403-10), FASTA (Pearson & Lipman, (1988) Proc Natl Acad Sci USA; 85(8): 2444- 
8) and, more preferably, PSI-BLAST (Altschul et al (1997) Nucleic Acids Res., 25(17): 3389- 
402), ClustalW (Thompson et al, 1994, NAR, 22(22), 4673-4680) or the like. Assessment of 
whether or not a residue is conserved will be clear to the skilled reader and will depend on the 

30 number of related proteins that are aligned and the degree of relatedness amongst them. For 
example, if only two family members are aligned and these proteins share 50% identity, then the 
conserved residues are those that are shared between the two proteins at the same positions. On the 
other hand, if 20 proteins in the same family are aligned, it is most unlikely that the least alike of 
these proteins will possess homology as hi^ as this. Preferably, then, m an aligimient between the 

35 candidate for mutation and representative members of llie protein family, a conserved residue is 
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one that is shared between at least 20 % of the family, preferably at least 30 %, preferably at lea 
40 %, preferably at least 50 %, and may be at least 60 %, preferably 70 % or more. For example, 
sequence homology in the Tmnor Necrosis Factor ligand family is highest between the (aromatic) 
residues that are responsible for trimer formation; these residues are thus unsuitable candidates for 
5 mutation according to the methodology of the invention. 

Once non-conserved residues are identified, the next step in the method requires an evaluation of 
which of these residues are candidates for mutation. 

Preferred aspects of the methodology mutate non-conserved residues that occupy positions at the 
surface of the monomer component in the multimeric cytokine protein structure. By doing this, the 
1 0 multimeric structure is stabilised as a whole. 

As used herein, the term "at the surface" means that tiie residue concerned in the monomer remains 
surface-exposed in the multimer complex. Such residues ai^ solvent-exposed and thus hydrophilic 
m nature. Of course, surface-exposed residues will be present not only at the surface of each 
monomer, but will also be surface-exposed in the multimer complex. Knowledge of the position of 
15 a particular residue in the structure of a protein may come from knowledge of the structure itself, or 
may be derived by extrapolation from the position of the equivalent residue in the structure of a 
protein in the same family. 

Another preferred aspect of the methodology is to mutate non-conserved residues near positions 
close to the interface between two monomer components of the multimeric cytokine protein 
20 structure. This has the effect of stabilising the multimeric structure of the protein through 
stabilisation of the inter-chain interfaces. 

As used herein, the term "near positions close to the mterface between two monomer components" 
means that the residue concerned in the monomer is close to or at the interface formed when two 
monomer components of a multimeric protein complex together. The residue must be near enough 

25 to this interface for its constituent atoms to influence monomer-monomer interactions, preferably in 
a positive way. For hydrophobic interactions the distance may be as close as the Van der Waals' 
radius of subject atoms. For hydrogen bonding the distance may be from 2.7 angstrom to 3.1 
angstrom, for electrostatic interaction the distance may be from 1.4 angstrom up to 12 angstrom. 
Such influence may be effected through, for example, polar or hydrophobic solvation energies, van 

30 der Waals' interactions, H-bond energies, electrostatics, or backbone and side chain entropies. 

For trimeric proteins, one preferred aspect of the methodology is to mutate residues that occupy 
positions along the central trimeric axis in the multimeric cytokine protein structure. This has the 
effect of stabilising the trimer. 
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As used herein, the term "residues along the central trimeric axis" means that iJie residue concerned 
in the monomer is close to or at the interface formed when three monomer components of a 
trimeric protein complex together. As described above, the residue must be near enough to this 
interface for its constituent atoms to influence the confluence of tixe three monomer components 
5 into a trimeric complex. 

Most preferably, a method according to the invention mutates residues in more than one of the 
classes referred to above, preferably in at least two of the classes and even more preferably in all 
three of these classes. 

The methodology described here is. to the inventors' knowledge, the first time that a technique 
1 0 incorporatmg computational engmeering has been applied to redesign a large (>100 amino acids) 
all-|3-sheet protein towards a more thermally stable variant. 

Until recently, lack of protein structural information in relation to multfaneric p sheet cytokines and 
their receptors made intervention on the level of signal transduction initiation (ligand-receptor 
interaction) unfeasible. Detailed crystal structural information is now available for many of these 
15 cytokines, together with reliable homology models. Therefore, studies on protein-protem 
interaction and the elucidation of mechanisms of ligand-receptor interaction and activation ar« now 
possible. For example, the following TNF ligand family members have been crystallised, either in 
complexed or micomplexed forms: Human BAFF. Blys (Liu Y. et al., 2002 Cell 108(3):3 83-94; 
Oren DA. et al, 2002 Nat Struct Biol.. 9(4):288-92.; Karpusas M. et al, 2002 J Mol Biol. 
20 315(5):1145-54.); human CD40L (Karpusas M. et al., 2001. Structure (Camb). Apr 4;9(4):321-9.); 
murine RANKIVTRANCE (Lam J. aU, 2001 J Clin Invest. 108(7):971-9..); human TNF-D 
(Bamier DW. et al., 1993 Cell. 1993 May 7;73(3):431-45.; Eck MJ. et al., 1992 J Biol Chem., 
267(4) 21 19-22.) human TRAIL (Mongkolsapaya J et al, 1999 Nat Struct Biol. 6(11): 1048-53., 
Cha SS. et ah, 1999. 2000 Immunity. 1999 Aug;ll(2):253-61. 2000 J Biol Chem. 2000 Oct 
25 6;275(40):31 171-7.; Hymowitz SG. et al, 1999 Mol Cell. 4(4):563-71), human TNF-D (Reed C. et 
al, 1997 Protein Eng. 10(10):1101-7; Cha SS. et al., 1998 J Biol Chem. 1998 Jan 23;273(4):2153- 
60; Naismith JH. et al., 1996 Structure. 1996 Nov 15;4(ll):1251-62., Naismith JH. et al., 1995. J 
Biol Chem. 1995 Jun 2;270(22):13303-7. 1996 J Mol Recognit. 1996 Mar-Apr;9(2):l 13-7.; Carter 
PC. et al., 2001 ProcNatl Acad Sci U S A, ;98(21): 11879-84. Erratum in: ProcNatl Acad Sci U S 
30 A 2001 Dec 18;98(26): 15393.). Therefore, the amino acids that make up domains representing 
protein-protein interaction motifs between these ligands and then- respective receptors are now 
known. Such interacting domains in the TNF family have an intrinsic propensity to initiate 
signalling pathways associated with the modulation of diseases such as cancer and chronic diseases 
such as autoimmune disease, and are starting points for drug design. 




Visualisation of the structure of a candidate cytokine protein may be performed computationally 
using one or other of the many systems available for this task. Such systems are generally designed 
to import data describing a protein structure (such as a structure from the Protein Data Bank, the 
PDB) and convert this to a three-dimensional image. At present, the largest public depository of 
5 information relating to protein structure is the PDB database ( http://www.rcsb.org/pdb \ that now 
includes over 23,000 protein and nucleic acid structures, elucidated using methods of x-ray 
crystallography and nuclear magnetic resonance. Images of protein structure allow intimate 
analysis of the structure of a protein to evaluate the positions of each residue in the protein 
structure, and an evaluation of which residues participate in interactions with other moieties, such 
10 as a receptor or monomer partner. 

For example, in the example described herein, the structure of the TRAIL protein is visualised 
using the template PDB structure 1DU3 (Cha et a/., J, Biol Chem. 275, 3 1171-3 1 177 (2000)). The 
crystal structure at 2.2 A resolution contains the trimeric structure of human TRAIL in complex 
with the ectodomain of the DR5 (TRAIL-R2) receptor. In this case, the TRAIL monomer lacks an 
15 external, flexible loop (130-146), not involved in receptor binding or in monomer-monomer 
interaction. Accordingly, to complete the molecule, this loop was modelled using the structure of 
1D4V (2.2 A) (Mongkolsapaya et aL, Nat. Struct. Biol. 6, 1048-1053 (1999)), a monomeric TRAIL 
in complex with DR5 (TRAIL-R2) receptor, having the atomic coordinates of the loop. Finally, the 
TRAIL molecule was isolated by removing the receptor molecules from the PDB file. 

20 Already, there are computer-implemented programs that allow the prediction of protein structure 
ab initio, or by inference from closely-related proteins of known structure. Accordingly, for the 
method of the invention, it is not strictly necessary for the structure of a candidate protein to be 
known. A significant amount of information can be gleaned by analogy from structures of related 
proteins; for example, TNF ligand family members show similar trimeric structures. 

25 For example, for some p sheet multimeric cytokines, such as APRIL, there is no available structure 
of the complex with the receptor. However, there is generally structural information available for 
homologous ligands and receptors, which allows the complexes to be built by Homology 
Modelling. This is particularly true in those cases in which the sequence homology is higher then 
40 % and insertions or deletions are not found in the binding region of ligand and receptor. 

30 Visualisation of the isolated monomers, monomer-monomer interface and central core of the 
candidate protein will show the residues that are potential candidates for mutagenesis. 

In the case of design for stability mutants, in order to filter out unsuitable residues for mutagenesis, 
any highly conserved hydrophobic residues should be discarded from the list of potential 
candidates for mutagenesis. In addition, residues involved in receptor binding should be discarded 
35 in the case of design for stability mutants. These residues cannot be mutated without disrupting 
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interactions with the receptor. The sequence space search for every position may preferably be 
simplified, by checking the naturally occurring amino acids in a multiple sequence alignment of 
proteins belonging to the family of interest, thus decreasing the computing time, and subsequently 
focusing on non-conserved residues. 
5 Preferably, in conjunction with a visualisation tool, a protein design algorithm is used to facilitate 
the identification of candidate residues for mutation. Examples of suitable algorithm include the 
"WHATIF" program (Vriend, (1990), J Mol Graph 8(1), 52-6, 29) or more sophisticated programs 
such as the algorithm "PERLA" (protein engineering rotamer library algorithm) (Fisinger S, 
Serrano L, Lacroix E. Protein Sci. 2001 Apr; 10(4): 809- 18). The latter, based on a rotamer library 
1 0 search, allows a combinatorial exploration at different positions simultaneously in the protein, and 
identifies the optimal sequence that improves a structural property of the protein (such as its 
stability). A detailed description of this algorithm is available elsewhere (Lacroix,E. Protein design: 
a computer based approach, Ph.D.thesis. (U. Libre de Bruxelles,1999)) 
(http://ProteinDesign.EMBL-Heidelberg.DE) and its use has been previously described (Ventura et 
15 «/., Nat. Struct. Biol. 9, 485-493 (2002); Fisinger et al. Protein Sci. 10, 809-818 (2001); Lopez et 
al, J. Mol. Biol. 312, 229-246 (2001); Reina et ah, Nat. Struct. Biol. 9, 621-627 (2002)). Other 
suitable algorithms mclude 3D Jigsaw and EasyPred. 

Briefly, the PERLA algorithm performs strict inverse folding: a fixed backbone structure is 
decorated with amino acid side chains from a rotamer library. Relaxation of strain in the protein 
20 structure is achieved via the generation of subrotamers. Most terms of the scoring function are 
balanced with respect to a reference state, to simulate the denatured protein. The side chain 
conformers are all weighted using the mean-field theory and finally candidate sequences with 
modelled structures (PDB coordinates) are produced. 

In the case of a multimeric protein such as the TNF family ligand TRAIL, protein design with 
25 PERLA requires the following steps. 

Firstly, residues of a monomer that could establish specific interactions with the contiguous 
monomer must be identified and selected as described above. 

Secondly, side chains that contact the residues that are candidates for mutation must be identified to 
allow side chain movements that are necessary to accommodate the new residues introduced by the 
30 algorithm. PERLA automatically selects these residues based on a geometrical approach that takes 
Ca-Cci distances and the angle between Ca-Cfi vectors into consideration. 

Thirdly, the algoriftm places the amino acid repertoire at each position selected from a set of 
naturally occurring amino acids m a multiple sequence aligmnent of the TNF Ugand family, and 
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eliminates from consideration those side-chain conformations and amino acids that are n^ 
compatible with the rest of the structure. 

Fourthly, all possible pair-wise interactions are explored to eliminate those combinations that are 
less favourable. This energy evaluation is preferably carried computationally, for example using a 
5 force field algorithm such as the program FOLD-X (Guerois et aL, J, MoL Biol 320, 369-387 
(2002)) or a modified version (Schymkowitz, J., Borg, J., Rousseau, F. & Serrano, L, "manuscript 
in preparation") of this program, available at (http://fold-x.embl-heidelberg.de). The force field 
module of FOLD-X evaluates the properties of the structure, such as its atomic contact map, the 
accessibility of its atoms and residues and the backbone dihedral angles, in addition to the H-bond 
1 0 network and electrostatic network of the protein. The contribution of water molecules making two 
or more H-bonds with the protein is also preferably taken into account. FOLD-X then proceeds to 
calculate all force field components: polar and hydrophobic solvation energies, van der Waals' 
interactions, van der Waals' clashes, H-bond energies, electrostatics, and backbone and side chain 
entropies. 

1 5 Finally, an output of sequences and PDB coordinates corresponding to the best calculated solution 
(in terms of energy) is produced and may be ranked in terms of free energy, for instance, using 
FOLD-X. 

The resultant data files (preferably PDB files or similar) containing the mutations should then be 
energy-minimized. One way of doing this is by using a program such as GROMOS 43B1 as 
20 unplemented in Swiss-PdbViewer v3.7b2 (Guex & Peitsch; Electrophoresis 18, 2714-2723 
(1997)), and evaluated by FOLD-X (http://fold-x.embl-heidelberg.de). The final energies of the 
models are then compared to the reference, wild-type structure and expressed as AAG (kcal mol"^). 

Favourable mutations may of course be combined and evaluated in terms of free energy (kcal mol- 
1). Unfavourable combinations (e.g. high Van der Waals' clashes) should be eliminated. 

25 If necessary, outputs of sequences and co-ordinates may subsequent to the design process be 
reintroduced in the design algorithm for a further roimd or rounds of design. 2nd, 3^*^, 4*, 5*** or 
more rounds of design may be used. 

The above methodology facilitates the calculation of free energy, which must be improved by 
mutation of the monomer, relative to the free energy of wild-type unmutated monomer. By "free 
30 energy" is meant the free energy of folding. By "free energy of folding" is meant the diflFerence in 
Gibbs energy (including enthalpic and entropic terms) between the protein in a folded or partially 
folded state and the protein in its fully denatured state. In calculating the free energy of folding, the 
calculation should take into account factors such as the accessibility of atoms, the existence of 
hydrogen bonds and the existence of electrostatic charges between atoms that are predicted to occur 
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in the folded structure, the van der Waals' interactions, the solvation, the main chain and side chain 
entropic effects being also taken into account. These atomic energetic calculations are then 
summed. This calculation should thus ideally take account of the nature of the stabilising 
interactions that compete with or favour the topological constraints that are inherent in a particular 
5 protein folding pathway, taking sequence considerations into account when calculating the main 
chain, the side chain and the loop entropic costs and the favourable contributions to protein 
stability. Such a method thus should incorporate detailed energetic functions that effectively 
estimate the balance between topological constraints (entropic origin) on the one hand and 
interactions stabilising a fold, on the other. 
1 0 The free energy of a particular protein may be assessed using any suitable method, as will be clear 
to the skilled reader. A number of suitable computer programs exist for the automated calculation 
of free energy; one preferred program is the FOLD-X program (Guerois R, Nielsen JE, Serrano L.. 
J Mol Biol. 2002 Jul 5;320(2):369-87) which uses optimal energy functions 1» rank sequences 
according to their fitness for a given fold. 
1 5 Such molecules identified herein specifically interfere at the ligand receptor family interface where 
apoptosis or autoimmune signalling pathways are triggered. A combined methodology that utilises 
Ae design approach outlined above in conjunction with such experimental techniques, is mcluded 
as an aspect of the present invention. 

Molecules generated using the above methods may also be used to elucidate the mechanism of 
20 action of P sheet multimeric cytokines. For example, although the crystal structures of TNF family 
members are known, little is known of the exact mechanism of binding and signal initiation by the 
ligand-ieceptor complex. Several TNF ligand femily members, such as TRAIL, APRIL and 
RANKL. bind more than one receptor type which dependmg on receptor type may or may not 
trigger signal transduction pathways. Many questions therefore still exist with respect to molecular 
25 regulation of diseases such as cancer or autounmune disease on the level of ligand-receptor 
complex formation and subsequent initiation of signal transduction. In vitro and in vivo studies 
aimed at the characterisation of this complex should add to a better understanding of the underlying 
(patho) physiological response and will aid in creating unique lead molecules. Use of these lead 
compounds will facilitate the elucidation of more complex basic questions in relation to protein- 
30 protein mteraction, signal transduction pathways and bioactivity m in vitro and in vivo situations. 
In particular, protem or peptide mimetics generated may act as receptor agonists, antagonists, 
which may be engineered to have mcreased or decreased structural stability, receptor binding 
selectivity and/or bioactivity. In particular, such compounds have utility in the regulation of 
apoptosis. Members of the TNF ligand family induce signallmg pathways that lead to apoptosis or 
35 programmed cell deatii (PCD) through interaction with then- cognate receptors. Ligand-bound 




receptors transmit the signal across the membrane by bringing their cytoplasmic portions into clos^ 
proximity, leading to the recruitment and activation of downstream effector proteins. Apoptosis, 
the mechanism whereby multicellular organisms dispose of superfluous or damaged cells in a 
controlled manner, is a process fundamental to the normal development and homeostasis of 
5 multicellular organisms. However, the impairment of apoptosis regulation is implicated in the 
pathogenesis of cancer and several chronic diseases, including acquired immunodeficiency 
syndrome (autoimmune disease and AIDS) and neurodegenerative disorders (eg Parkinsons). 
Common examples are chronic transplant dysfunction, rheumatoid arthritis, chronic obstructive 
pulmonary disease (COPD) and asthma. Molecules that mediate reversal of imbalance in signal 
1 0 transduction could be effective therapeutics in diseases. Cell induced apoptosis is mediated chiefly 
by members of the TNF ligand family that interact with cognate receptors to trigger apoptosis. 
Soluble portions of these cytokines or their receptors, or mimetics thereof, are thus attractive 
candidates to be used as therapeutics for a variety of diseases related to apoptosis impairment. 

In addition, a greater understanding of the role of TNF ligand family members may be achieved in 

15 controlling lymphocyte function, in order to identify novel targets for autoimmune therapy. 
Deregulated Activation-Induced Cell Death (AICD) may lead and contribute to autoimmunity. 
Impairment of AICD leads to accumulation of auto-reactive and chronically activated T cells. 
These cells can express various immune modulatory ligands, including APRIL and BAFF, which 
can alter B cell functions, causing autoantibody secretion and finally autoimmunity. The ligation of 

20 the TNF receptor family members may either lead to apoptosis through caspase-8/10 activation or, 
alternatively proinflammatory reactions, cell proliferation and differentiation through activation of 
NFkB. Activated T cells express a wide range of TNF ligand-receptor family members, all having 
different effects on lymphocyte fate. APRIL acts as a co-stimulator of T and B cells and enhances 
T cell survival in autoimmune disease. BAFF is essential for B cell Tl to T2 stage maturation, and 

25 thus, immunoglobulin secretion. RANKL initiates differentiation of osteoclast precursors that are 
responsible for bone desorption. In rheumatoid joints 40 % of the leukocj^es are T cells, mainly 
CD4-h. The proportion of B cells is only 1-5 %, although their contribution to chronic disease 
development is still great. Accumulation of these cells in inflamed joints leads to further 
lymphocyte activation and uncontrolled systemic immune responses. In RA, for example, one 

30 needs to target both hyper-plastic synovial cells and the immune cells accumulating in the joint 
capsule and also circulating in the body. 

TNF family members are important immune regulators through promotion of proliferation and by 
participating in AICD of peripheral T cells. Inhibition of endogenous TRAIL function leads to 
impaired AICD, proliferation of autoreactive lymphocytes and synovial cells resulting in arthritic 
35 inflammation and joint tissue destruction (Song K et al. J. Exp. Med., 2000; 191(7): 1095). APRIL, 
on the other hand can act as a co-stimulator of T cells and is able to prolong T cell survival. By 
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dissecting the molecular pathway of T cell activation and the cell death induced by reactivation we 
can understand the exact role of TNF ligand family members in autoimmunity. 

For example^ studies of AICD human peripheral T cells may be isolated from the blood of healthy 
individuals. T cells can be activated by anti-CD3 and anti-CD28 antibodies, or 
5 phytohaemagglutinin and maintained in the presence of various amounts of IL-2 and/or IL-15. 
AICD will then be induced at various days following activation by addition of anti-CD3 
monoclonal antibodies. The potential of various TNF family members to induce AICD of CD4+ or 
CD8+ T cell populations at various times can then be tested by addition of agonistic/antagonistic 
ligands such as those described herein; these will compete with signalling. 

10 Cell death in the CD4+ and CD84- population can be tested by, for example, the 7- 
aminoactinomycin method (Szondy Z et aL 1998, J.Infectious. Dis.l78:1288). In addition, the 
requirement for IL-2 in sensitising activated T cells to TRADL-R and Fas-mediated death will be 
examined. Since IL-I5 was shown to inhibit AICD, we will examine whether IL-15 interferes with 
TNF receptor family expression of activated T cells and thus with sensitisation to AICD (Marks- 

15 Konczalik J. et aL PNAS 2000 97(21):! 1445-11450). Based on these findings a functional assay 
can be suggested to test possible deficiencies in various autoimmune patients. We will attempt to 
understand the function of APRIL in modulating T-cell survival. Using APRIL as a co-activator, 
togetiher with anti-CD3, we will examine how it modulates TRAIL, FasL or TNF signalling, IL-2 
secretion and in this way the influence on T cell survival. If TNF ligand family members or 

20 variants are shown to have an influence we will proceed to characterise these molecules in several 
forms of autoimmunity. 

According to a further aspect of the invention, therefore, there is provided a P sheet multimeric 
cj^okine whose sequence has been altered so as to generate a more stable cytokine than the wild- 
type, unaltered cytokine protein. 

25 Multimeric cytokines included wilhin the terms of the invention are all p sheet multimeric 
cytokines, as described above. Examples are presented in Table 6. Preferred multimeric cytokines 
according to the invention are members of the TNF ligand family (see Bodmer et ah^ 2002. Trends 
Biochem. Sci. 27, 19). A preferred TNF ligand family member is the TRAIL protein. 

Preferably, such a p sheet multimeric cytokine is mutated in the soluble C-terminal portion of Ihe 
30 molecule. Examples of suitable residues for mutation are those at the following positions: 

a) a non-conserved residue at the surface of the monomer component of the multimeric 
cytokine (herein termed 'monomer' set); 

b) a non-conserved residue close to the interface between two of the monomer components of 
the multimeric cytokine (herein termed *dimer' set); 



c) for trimeric cytokines, a non-conserved residue along the central trimeric axis (her9 
termed 'trimer' set). 

This list is not exhaustive - various miscellaneous mutations may also be made dependent on the 
particular cytokine, that do not fall into any of the three categories above. The identification of non- 
5 conserved residues is described above. Similarly, identification of residues that fell into the above 
classes a) to c) is also described above. 

Preferably, a mutation in category a) falls in the external loop that connects that C and D anti- 
parallel beta strands of the cytokine (the CD loop), following the notation according to Eck (Eck et 
al, J. Biol. Chem. 267, 2119-2122 (1992)). Preferred examples of such mutations in the TRAIL 
10 protein include positions E194 and 1196. Equivalent mutations in other p sheet multimeric 
cytokines will be apparent to tiiose of skill in the art. Preferably, mutations introduced at these 
residues are E194I and/or I196S. 

In the TRAIL protein, when both these mutations are made, this has been found to result in a large 
improvement of free energy compared to wild-type TRAIL (AAG = -9.7 kcal mol"' monomer '). 
15 This high energy value is due to the fact that a trimer is being studied, in addition to the presence of 
significant van der Waals' clashes in the crystal structure (~5 kcal mol"' monomer'), which are 
removed upon mutation. Preferably, mutations ate made at both positions E194 and 1196; more 
preferably; both the mutations E194I and I196S are made. The predicted increase in stability of this 
double mutant (herein termed Ml) can be explamed smce Glu 194 is surrounded by hydrophobic 
20 groups (Trp 23 1, Phe 192, Ala 235) and the carboxyl group is uncompensated. Hie mutation Glu 
194 to He rectifies this situation by replacing the charged residue for a medium-sized hydrophobic 
residue. Conversely, He 196 is surrounded by polar residues (Asn 202, Lys 233) and is very close 
to the backbone, resulting in probable van der Waals' clashes. Mutation to Ser avoids clashes and 
allows formation of a hydrogen bond to Asn 202, located in the opposite part of the CD loop. Both 
25 mutations improve polar solvation energy, in addition to ameliorating side chain and backbone 
entropy. 

Preferably, a mutation in the dimer set may be made at one or more of the following positions: 125, 
163, 185, 187, 232, 234, 237, 203, 205, 239, 241, 271 and 274. In the TRAIL protein, the residues 
at these positions are the following: H125, F163, Y185, Q187, S232, D234, Y237, D203, Q205, 
30 L239, S241, E271 and F274. The skilled reader will be able to identify equivalent positions in other 

P sheet multimeric cytokine proteins, for example, by multiple alignment or by structural 

alignment. Preferred mutations in this class include mutations at D203, Q205 and Y237. 

Preferably, mutations introduced at these positions are one or more of D203I, Q205M and Y237F. 

More preferably, two or all three of these mutations are made. 
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A mutant TRAIL protein comprising these three mutations is herein termed M2. The design of M2 
leads to the creation of a hydrophobic cluster to stabilize the interaction between residues 203 and 
205 (D strand) of one monomer, and residue 237 (F strand) of the adjacent monomer. Gin 205 and 
Tyr 237 together form an intermolecular hydrogen bond, and Asp 203 points to a gap in the 
5 monomer-monomer interface. Mutation to He (203), Met (205) and Phe (237) breaks the Q205- 
Y237 hydrogen bond, but facilitates the tight packing of these residues, improving van der Waals' 
interactions, hydrophobic and polar solvation energies of the entire TRAIL molecule, without a 
further increase of van der Waals' clashes. 

Preferably, a mutation in the trimer set may be made at one or more of the following positions: 227, 
1 0 230 and 240. In the TRAIL protein, the residues at diese positions are the following: R227, C230 
and Y240. The skilled reader will be able to identify equivalent positions in other p sheet 
multimeric cytokine proteins. A preferred mutation in this class is R227M. Also the combination 
of C230S and Y240 is preferred, 

A mutant TRAIL protein comprising this mutation is herein termed M4. The Arg 227 residues of 
15 mutant M4 are located in strand E, equidistantly opposed in a central position along the 
longitudinal axis of the TRAIL trimer. The three arginines are surrounded by hydrophobic (He 
242), polar (Ser 241, Ser 225) and aromatic (Tyr 240, Tyr 243) residues. These tyrosines direct the 
hydroxyl groups away from Arg 227, Uius creating a rather hydrophobic cavity. The high 
concentration of positive charges is apparently not well compensated, since it forms only hydrogen 
20 bonds with the backbone (carbonyl groups of Ser 241). Thus, the mutation of these positions to Met 
could help to accommodate the hydrophobic environment, as well as to decrease the repulsion of 
monomers due to uncompensated positive charges. 

Preferably, a mutation in the miscellaneous set may be made at one or more of the following 
positions: 123, 272, 225, 280, 163, 123 and 208. In the TRAIL protein, the residues at these 
25 positions are the following: A123, A272, S225, V280, F163, A123 and V208. TTie skilled reader 
will be able to identify equivalent positions in other P sheet multimeric cytokine proteins. A 
preferred mutation in this class is S225A. 

A mutant TRAIL protein comprising this mutation is herein termed M3- Residue 225 of M3 
(S225A), is located in strand E and is solvent exposed in the monomeric form. However, after 
30 trimerization, this position becomes buried in a small pocket, leaving the side chain of the 
hydrogen bond donor Ser uncompensated. After mutation to Ala, the energy of the model is better 
than wild-type TRAIL for both polar and hydrophobic solvation energies, in addition to side chaia 
entropy. 

A further preferred mutant p sheet multimeric cytokine is one which incorporates a combination of 
35 the mutations described above, such as 2, 3, 4, 5, 6, 7, 8, 9, 10 or more such mutations. One 
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example of such a mutant combines mutations at positions 194, 196 and 225. In the TRAIL proteii^ 
the residues at these positions are E194, 1196 and S225. Preferably, the mutations introduced are 
El 941, I196S and S225A; a TRAIL mutant containing these three mutations has been engineered 
and is referred to herein as C 1 . 

The above mutations may be introduced in the full length cytokine sequence. Preferably, however, 
the above mutations are introduced into soluble forms of p sheet multimeric cytokines. For the 
TRAIL protein, a preferred soluble template into which these mutations may be introduced 
comprises amino acids 1 14-281 of the full length TRAIL protein. However, as the skilled reader 
will appreciate, variations in this template will very likely retain the properties of this soluble form 
and show biological activity if additional residues C terminal and/or N terminal of these boundaries 
in the polypeptide sequence are included. For example, an additional 1, 2, 3, 4, 5, 10, 20 or even 30 
or more amino acid residues from the wild-type cytokine sequence, or from a homologous 
sequence, may be included at either or both the C terminal and/or N terminal of these boundaries, 
without prejudicing the ability of the polypeptide fragment to fold correctly and exhibit biological 
activity. Similarly, truncated variants of this template in which one or a few amino acid residues 
(for example, 1, 2, 3, 4, 5, 10 or more) may be deleted at either or both the C terminus or the N 
terminus without prejudicing biological activity 

The methods described above have been applied to a prototypic example, TRAIL, for purposes of 
illustration. It will be appreciated that this example is intended as illustrative and is not limiting in 
any way. Novel mutants of TRAIL have been designed in order to increase the stability of the 
bioactive trimer. As will be evident from the Examples included herein, using this approach 
succeeded in extending the apparent thermal stability of the p-sheet protein by more than 5°C. This 
correlates with the preservation of overall structural characteristics as highlighted by the lasting 
bioactivity of these mutants as tested experimentally. For example, when measuring the residual 
bioactivity of wild-type TRAIL and TRAIL mutants upon incubation at 73°C for 1 hour, it was 
shown that, while wild-type TRAIL was all but thermally inactivated after --20 min, the TRAIL 
mutants Ml, M2, M3 and CI, significantly, had an improved stability. Although not tested herein, 
it has been shown that in case of certain therapeutically interesting proteins, improvement of 
thermal stability can also be indicative of an improved in vivo half-life (Luo et aL, Protein Set 1 1, 
1218-1226 (2002); Filikov et ah. Protein Sci. 11, 1452-1461 (2002)). Furthermore, the increase in 
thermal stability did not affect the biological activity of Ml, M3 and CI. Significantly, it is shown 
herein that stabilisation of the CD loop in a single monomer resulted in stabilisation of the entire 
trimeric molecule. 
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As stated above, it would be desirable were it to be possible to alter the selectivity/specificily of 
cytokines for their cognate receptors. The inventors have now achieved, for the first time, the 
alteration of the receptor binding selectivity/specificity of a large multimeric protein structure using 
computational redesign. Automated computer algorithms have been used in combination with 
5 hand-crafting and selection of pertinent residues to alter receptor binding selectivity of a 
multimeric all p-sheet protein, TRAIL. 

Accordingly, this aspect of the invention provides a method for the alteration of the selectivity of a 
p sheet multuneric cytokine for a target receptor, the method comprising 

identifying amino acids in the cytokine that are located in the receptor-binding interface as 
1 0 candidates for mutation; 

discarding residues interacting with amino acids that are conserved among receptors bound by the 
cytokine protein; 

discarduig residues mteracting with the receptor backbone; and 

substituting each of one or more residues in the cytokine protein for replacement residues that 
1 5 include amino acid side-chain conformations that are predicted to fit into the binding interface with 
the target receptor so as to provide an increase in binding affinity of the cytokine protein for that 
target receptor. 

As discussed above, alteration of selectivity for receptor is of significant interest in the cytokine 
field. For example, TNF ligand family members bind to receptors of the TNF receptor family, and 
20 upon binding an intracellular signalling cascade is activated. Different cell subtypes have different 
profiles of TNF receptor family expression. Many TNF ligand family members can signal through 
more than one type of TNF receptor family member proteins, resulting m different biological 
activities, depending on the receptor and the expression profiles of these receptors on the cell 
surface. For a protein therapeutic/diagnostic it may be advantageous to selectively activate (or 
25 inhibit) one of the receptors, for example to differentiate between a cell-proliferating activity and a 
cell-death mducing activity. Using the method of the invention described above, this is now 
possible even for large multimeric molecules. Furthermore, an improved selectivity/specificity 
would allow lower concentrations of a therapeutic variant to be administered than would be 
necessary with respect to wild-type cytokine. 
30 Such selective variants of cytokines are advantageous for use as protein therapeutics or diagnostics, 
since they exhibit a relatively close resemblance to the wUd-type structure and this reduces the risk 
of immunogenicity. They also have advantages when compared to agonistic or antagonistic 
antibodies. In contrast to antibodies, variants can be produced m Escherichia coli and the mode of 
signalling resembles the wild-type mode of signalling more closely. 




According to the method of this aspect of the invention, selectivity for receptor is of primary 
importance. Accordingly, affinity for a receptor may be slightly compromised for improvements in 
selectivity/specificity. 

Using a method such as that described herein, novel mutants of the TRAIL protein have been 
5 designed in order to shift selectivity/specificity towards its different membrane receptors (DR4 
(TRAIL-Rl), DR5 (TRAIL-R2), DcRl (TRAIL-R3) and DcR2 (TRAIL-R4)). As described above, 
having selective inducers of DR4 (TRAIL-Rl) and DR5 (TRAIL-R2) signalling is of considerable 
interest, due to the different cross-linking requirements of both death receptors. Depending on the 
cross-linking the signalling pathway could induce the proliferative or the apoptic pathway. 

1 0 Some residues important for binding and biological activity have been already identified in TRAIL 
by alanine-scanning mutagenesis (Hymov/itz et al.^ Biochemistry. 2000 Feb l;39(4):633-40), but in 
this study, the inventors have focused not only in the identification of critical residues for 
selectivity, also have suggested the best amino acid substitution at these positions to get a 
maximum effect in selectivity. We show that some residues are critical for receptor binding and 

1 5 selectivity; alanine-scanning mutagenesis could not identify these. The results also confirm that Ihe 
choice of amino acid chosen for replacement is important. 

This example acts as a prototypic example of how the method of the invention may be applied to a 
large multimeric p sheet protein. As the skilled reader will be aware, methods used in this study are 
also applicable to other multimeric cytokines, such as TNF family ligands. 

20 For example, a recent significant publication has shown that TRAIL-R3 is upregulated by p53 in 
breast tumour cell through use of the genotoxic drug, doxorubicin (Ruiz de Almodovar et ah, 2003, 
Nov 17, Manuscript M3 11243200). This implies that efficacy of wild-type TRAIL may be 
diminished in anti-tumour therapy since it also binds the decoy receptors (that do not initiate 
apoptosis). Therefore, variants of TRAIL, that have altered selectivity/specificity could be direct to 

25 the pro-apoptotic receptors, DR4(TRAIL-R1) or DR5 (TRAIL-R2) and would have ultimately 
improved application in cancer treatment. 

Methods for identifying amino acids that are located in the receptor-binding interface are known in 
the art. Preferably, this is done through visualisation of the structure of the ligand protein, ideally in 
complexed form with receptor, using one or other of the many systems available. Such systems are 

30 generally designed to import data describing a protein structure (such as a structure from the 
Protein Data Bank, the PDB) and convert this to a three-dimensional image. Images of protein 
structure allow intimate analysis of the structure of the protein to evaluate the positions of each 
residue in the protein structure, and an evaluation of which residues participate in interactions with 
other moieties, such as a receptor or monomer partner. Selected side chains are those in the protein 

35 ligand that are physically close enough to be potentially interacting with receptor. 



20 

If no protein structure is available, it is likely that there will be structural mformation available for 
one or more homologous ligands and receptors, which allows the complexes to be built by 
Homology Modelling. This is particularly true in those cases m which the sequence homology is 
higher then 40% and insertions or deletions are not found in the bindmg region of ligand and 
5 receptor. For example, in the specific case of alteration of the selectivity of the TRAIL protem for 
DR5 (TRAIL-R2) or towards DR4 (TRAIL-Rl), a crystal model of TRAIL in complex with the 
ectodomain of the DR5 (TRAIL-R2) receptor is available (PDB identifier 1DU3). Models of 
TRAIL complexed with the three other membrane receptors (DR4 (TRAIL-Rl), DcRl (TRAIL- 
R3) and DcR2 (TRAIL-R4)) may be obtained using the "What If Homology Modeling web 
10 mterfece" (Vriend WHAT IF: A molecular modeling and drug design program. J. Mol. Graph. 
(1990) 8, J2-5d) (available at http://www.cinbi.lcun.nlW servers/WIWWWF). Pdb files of TRAIL 
in complex with these three receptors can be generated by imposing their backbone atoms over the 
same atoms of the receptor DR5 (TRAIL-R2), using a program such as Swiss-PdbViewer v3.7b2 
(Guex & Peitsch. SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative 
1 5 protein modeling Electrophoresis 18, 2714-2723 (1997)-). Finally, template receptor DR5 (TRAIL- 
R2) is removed from the generated PDB file. 

In a similar manner to the method of protein stabilisation described above, it is considered 
advantageous to use alignment information in order to focus the design on residues that do not 
interact with conserved residue positions in the target receptor. On occasion, however, altering 

20 conserved residues may also lead to changes in selectivity for the receptor. By conserved residue 
positions, is meant residues that are conserved between the different receptors that bmd to the 
protein of interest. For example, the TNF ligand family TRAIL binds to four different membrane 
receptors (DR4 (TRAIL-Rl), DR5 (TRAIL-R2), DcRl (TRAIL-R3) and DcR2 CrRAIL-R4)). 
Residues in the receptor binding interface that are conserved between die different receptors are 

25 likely to contribute to the binding of the ligand protein, meaning that their altemtion would very 
likely disrupt important ligand-receptor interactions that are necessary for effective ligand-receptor 
binding. For the same reason, any residues that are predicted to interact with tiie receptor backbone 
are not considered suitable candidates for mutation. 

As described above, identification of non-conserved residues can be done using any one of a 
30 number of systems known to the person of skill in the art, but preferably, computer-unplemented 
alignment algorithms such as PSI-BLAST or ClustalW are preferred. 

The combined approach of the method therefore employs family alignment information and a 
computational design algorithm. This reduces the sequence space search for every position in the 
protein being studied and decreases the computing time and power necessary for the methodology. 
35 This methodology allows rational design of the occurring interactions between the protein ligand 




and its receptors. An overall visual inspection of the binding interface of ligand with the differenr 
receptors should also be carried out and, if necessary, some rotamers changed. 

The method requires that each of one or more residues in the ligand protein are substituted for 
replacement residues that include amino acid side-chain conformations that are predicted to fit into 
5 the binding interface with the target receptor so as to provide an increase in binding affinity for that 
receptor. This step is preferably performed using a computer design algorithm such as PERLA, 
which performs inverse folding. Briefly, this algorithm decorates a fixed backbone structure with 
amino acid side chains from a rotamer library. PERLA thus performs a rotamer search looking for 
better side chain conformations, aiming to model the expected interactions of the protein ligand 

1 0 with its receptors. Relaxation of strain in the protein structure is achieved via the generation of 
subrotamers. Most terms of the scoring function are balanced with respect to a reference state, to 
simulate the denatured protein. The side chain conformers are all weighted using the mean-field 
theory and finally candidate sequences with modelled structures (PDB coordinates) are produced. 
Energy evaluation of the modelled structure must also be performed as part of this methodology, 

1 5 preferably using a program such as FOLD-X^ or an improved version (such as that available at 
http://fold-x.embl-heidelberg.de). The force field module of FOLD-X evaluates the properties of 
the structure, such as its atomic contact map, the accessibility of its atoms and residues, the 
backbone dihedral angles, in addition to the H-bond network and electrostatic network of the 
protein. The contribution of water molecules making two or more H-bonds with the protein is also 

20 taken into account. FOLD-X then proceeds to calculate all force field components: polar and 
hydrophobic solvation energies, van der Waals' interactions, van der Waals' clashes, H-bond 
energies, electrostatics, and backbone and side chain entropies. 

Using this program, all possible amino acid substitutions (preferably with the exception of Glycine, 
Proline, and Cysteine) are then introduced at the selected residue positions in the protein ligand in 

25 conformations (side chain rotamers) that are compatible with the rest of the structure. Glycine, 
Proline, and Cysteine are preferably omitted because Gly and Pro can influence the backbone 
conformation relatively more than other amino acids (Gly is more flexible, Pro less so). Also these 
residues have relatively large effects in the denatured state (Gly high entropy. Pro lower). Cys can 
also be difficult and is in an unpaired state that is generally unwanted in proteins, making them 

30 prone to aggregation and the like. Favourable mutations are then evaluated in terms of free energy 
(kcal mol-1), and unfavourable mutations (e.g. high Van der Waals' clashes) eliminated. An output 
of sequences and coordinates is then obtained and ranked in terms of free energy, for example, 
using the FOLD-X program. Some of these predictions can be discarded directly after theoretical 
energy calculations, without further experimental analysis. Others are progressed to mutagenesis 

35 studies. In this way, the method of this aspect of the invention substitutes one or more residues in 
the ligand protein substituted for replacement residues that include amino acid side-chain 
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conformations that are predicted to fit into the binding interface with the target receptor so as to 
provide an increase in binding affinily for that target receptor. 

Favourable mutations may of course be combined and evaluated in terms of free energy (kcal mol- 
1). Unfevourable combinations (e.g. high Van der Waals' clashes) should be eliminated. 
5 If necessaiy, outputs of sequences and co-ordinates may subsequent to the design process be 
remtroduced in the design algorithm for a further round or rounds of design. 2nd, 3'^ 4*, 5' or 
more rounds of design may be used. 

Preferably, the method of this aspect of the invention may be applied to multimeric p sheet 
cytokines. Examples of such cytokines are presented in Table 6. Other examples will be known to 
1 0 those of skill in the art. 

Of particular interest for the application of the methodology of the invention are the P sheet 
multimeric cytokines that are members of the tumor necrosis factor ligand family. Ligands 
belonging to this family are involved in a wide range of biological activities, ranging from cell 
proliferation to apoptosis, and they share similar structural characteristics. 
15 Accordmg to a further aspect of the invention, there is provided a p sheet multimeric cytokine 
whose sequence has been altered so as to alter its afFmity for a particular target receptor. 
Accordingly, one aspect of the invention relates to a cytokine that is mutated at one, two, three, 
four, five, sbc seven or all eight of these positions. Preferably, such a cytokine is a member of the 
TNF ligand family. More preferably, such a cytokine is TRAIL. 
20 Examples of suitable residues for mutation in TRAIL are those at the following positions: 
131, 269, 130, 160, 218, 220, 149 and 155. 

Equivalent residues (when crystal structural information is compared by eye or with use of 
molecular modelling) identified by those skilled in Hie art could be mutated in other p sheet 
multimeric cytokines, preference being for the cytokines of the TNF family. 

25 Equivalent substitutions at the above positions, with a specific type of residue are preferred. For 
example, a preferred mutation at position 131 is to Arg. A preferred mutation at position 269 is to 
His. A preferred mutation at position 130 is to Glu. Preferred mutations at position 160 are Lys and 
Met. Preferred mutations at position 218 are to Arg, Tyr or Phe. Preferred mutations at position 220 
are to Met or to His. Preferred mutations at position 149 are to Asp or to His. A preferred mutation 

30 at position 1 5 5 is to Met. 

In the TRAIL protein, preferred mutations are G131R, D269H, R130E, G160K, D218R, G160M, 
D218Y, I220M, I220H, R149D, R149H, D218F and E155M. Particularly preferred TRAIL mutants 
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are G160M, D269H, D218Y, R130E and I220M. In particular, the G160M and D269H mutants Jl 
prefeired as these are shown herein to have superior selectivity for the death receptor 5. 

Some critical residues identified in TRAIL for receptor binding without distinction between DR4 
and DR5 (TRAIL-R2) have been identified as He 220, Arg 149, Glu 155, Gly 160 and Asp 218. 
5 The I220H and I22M mutants both highly increase DR4 binding, and have no effect on DR5 
binding. The R149D mutant decreases binding to both the DR4 and DR5 receptors, whilst the 
R149H and D218R mutants have no effect on binding to either of these receptors. The El 55 mutant 
highly decreases binding to both the DR4 and DR5 receptors; the D218F and D218Y mutants also 
decrease affinity, although to a lesser extent. 

1 0 Also, it has been shown that different amino acid substitutions in the same position can have a 
different effect on receptor binding. Some positions, critical for receptor selectivity, due to a 
different decrease in receptor binding affinity between DR4 (TRAIL-Rl) and DR5 (TRAIL-R2) 
have been identified as Arg 130, Gly 131 and Asp 269. The R130E and G131R mutants decrease 
binding to DR4 but only slightly decrease binding to DR5. In contrast, the D269H mutant highly 

1 5 decreases binding to DR4, but has no observable effect on binding to DR5. 

The skilled reader will be able to identify equivalent positions in other p sheet multimeric cytokine 
proteins, for example, using multiple alignment programs or manual inspection of related 
sequences or structures. 

Preferably, affinity for receptor is improved for DR4 (TRABL-Rl) or DR5 (TRAIL-R2) over the 
20 affinity for the decoy receptors DcRl (TRAIL-R3) and DcR2 (TRAIL-R4). More preferably still, 
affinity for DR5 (TRAIL-R2) receptor is improved over affinity for DR4 (TRAIL-Rl). 

The above mutations may be introduced in the full length cytokine sequence. Preferably, however, 
the above mutations are introduced into soluble forms of P sheet multimeric cytokines. For the 
TRAIL protein, a preferred soluble template into which these mutations may be introduced 

25 comprises amino acids 114-281. Equivalent soluble forms of other cytokines will be clear to the 
skilled reader. However, as the skilled reader will appreciate, variations in this template will very 
likely retain the properties of this soluble form and show biological activity if additional residues C 
terminal and/or N terminal of these boundaries in the polypeptide sequence are included. For 
example, an additional 1, 2, 3, 4, 5, 10, 20 or even 30 or more amino acid residues from the wild- 

30 type cytokine sequence, or from a homologous sequence, may be included at either or both the C 
terminal and/or N terminal of these boundaries, without prejudicing the ability of the polypeptide 
fragment to fold correctly and exhibit biological activity. Similarly, truncated variants of this 
template in which one or a few amino acid residues (for example, 1, 2, 3, 4, 5, 10 or more) may be 
deleted at either or both the C terminus or the N terminus without prejudicing biological activity. 
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The methods described above have been applied to a protolypic example, TRAIL, for purposes of 
illustration. It will be appreciated that this example is intended as illustrative and is not limiting in 
any way. Novel mutants of TRAIL have been designed in order to increase binding to the receptor 
DR4 (TRAIL-Rl) relative to the binding to the receptor DR5 (TRAIL-R2). This 
5 selectivity/specificity would allow lower concentrations of the variants to be administered, for 
example in the case of use as a therapeutic, than would be used with respect to wild-lype TRAIL. 
The skilled reader will appreciate that the methods described herein, for the stabiUsation of a 
multimeric P sheet cytokine molecule, and for the alteration of selectivity of a multimeric p sheet 
cytokine molecule, may be combined. 
10 m particular, combinations between the stability variants described above can be made, giving 
variants with enhanced stability and altered selectivity/specificity. Such combinations of stable and 
receptor selective/specific cytokine molecules are included as specific aspects of the present 
invention. Examples of such molecules include any one of the TRAIL Ml, M2, M3 and CI 
mutants, combined with one or more of the D269H, G160M, D218Y, R130E and I220M 
15 selectivity/specificity mutants, which give variants with enhanced stability and altered 
selectivity/specificity. 

Conventional experimental methodologies may be used to supplement flie largely computational 
approach that is outlined above. For example, once ammo acid substitutions have been identified, 
these substitutions should be produced experimentally using site-directed mutagenesis and then 
20 tested for stability or selectivity/specificity and for retention of biological activity. Identified 
molecules can be further mutated using conventional techniques of molecular evolution, to develop 
more specific therapeutic lead molecules. In the case of molecular evolution, one particularly 
useful technology is phage display, which when coupled with an effective biopamiing strategy, and 
when integrated with rational design methodologies, should result in rapid molecular evolution of 
25 novel therapeutics. 

The invention also provides purified nucleic acid molecules which encode a mutant P sheet 
cytokine as described above. Further aspects of the invention mclude vectors, such as expression 
vectors, that contain such nucleic acid molecules, along with host cells transformed with these 

vectors. 

30 In an fiirther aspect, the invention provides a p sheet multimeric cytokine whose sequence has been 
altered by a method according to any one of the aspects of the invention described above, or a 
nucleic acid molecule encoding such a molecule, or a vector containing a nucleic acid molecule as 
described, for use in the therapy or diagnosis of a disease in which cytokines are implicated. This 
aspect of the invention includes a method for the treatment of such a disease, comprising 

35 administering to a patient, a cytokine, nucleic acid or vector as described above, in a 
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therapeutically-acceptable amount. Such molecules may also be used in diagnosis, for example, b 
assessing the level of expression or activity of a gene or protein (such as an over-expressed 
receptor) in tissue from said patient and comparing said level of expression or activity to a control 
level, wherein a level that is different to said control level is indicative of disease. 

A further aspect of the invention may comprise a pharmaceutical composition comprising a mutant 
cytokine, nucleic acid or vector as described above, in conjunction with a pharmaceutically- 
acceptable carrier. 

A still further aspect of the invention may comprise transgenic or knockout non-human animals 
that have been transformed to express a mutant cytokine as described above. Such transgenic 
animals provide useful models for the study of disease and may also be used in screening regimes 
for the identification of compounds that are effective in the treatment or diagnosis of disease. 

Importantly, the generation of mutant cytokine molecules such as those described above allows for 
the design of screening methods capable of identifying compounds that are effective in the 
treatment and/or diagnosis of diseases in which these cytokines are implicated. The interacting 
domains of these molecules have an intrinsic propensity to initiate signalling pathways associated 
with the modulation of cancer and other chronic diseases, including autoimmune disease, and are 
starting points for drug design. Such screening methods are included as aspects of the present 
invention. 

Various aspects and embodiments of the present invention will now be described in more detail by 
way of example. It will be appreciated that modification of detail may be made without departing 
from the scope of the invention. 

BRIEF DESCRIPTION OF DRAWINGS 

Figure 1. A) Side view of the TRAIL trimeric complex, showing the three monomers in red, blue 
and green. B) Top view of the same complex but viewed along the longitudinal axis, depicting the 
different sets used for design. Structure figures were generated using MOLMOL (Koradi et aL, J, 
Mol Graph. 14, 51-32 (1996)). 

Figure 2. Binding of wild-type TRAIL (•), M1(b) and M2 (o) to DR5 (TRAIL-R2) (♦ •) and DR4 
(TRAIL-Rl) (— ) receptors. 

Figure 3. Thermal denaturation profiles of wild-type TRAIL (•), M1(b), Ml (o), M3 (□) and CI 
(A). 

Figure 4. Stability of wild-type TRAIL (•), Ml(«), M2 (o), M3 (□) and CI (A) at 73 °C for 60 
min. 
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Figure 5. Remaining biological activity of wild-type TRAIL. Ml, M3 and CI (from left to right) 
upon incubation at 73 ° C during 60 min. 

Figure 6. A) Comparison, between wild-type TRAIL and Ml, of the local environment around 

residues 194 and 196. B) Comparison between wild-type TRAIL and M2. Backbones of the two 
5 adjacent monomers are in green and blue, respectively, and the backbone of the DR5 (TRAE.-R2) 

receptor is in grey. Hydrogen bond interactions are depicted in dashed green lines. 

Figure 7. Receptor bindmg as a fraction of DR5 (TRAIL-R2) binding. 

Figure 8. Receptor binding as a fraction of DR4 (TRAIL-Rl) binding. 

Figure 9. Cytotoxic potential of D269H - Annexin V staining. 
10 Figure 10. Testing TRAILreceptorspecifickillingofD269Hmutant-AmiexinV staining. 

Figure 1 1 . DR5 (TRAIL-R2) selective killing by D269H - Annexin V staining. 

Figure 12. D269H and TRAIL have comparable cytotoxic potential. A, cells treated with TRAIL 

and D269H for 2h; B cells treated for 3h (B). The graph shows a representative of 2 independent 

experiments. 

1 5 Figure 13. Neutralisation of DR5, but not DR4 was able to block cell death induced by D269H. 
EXAMPLES 

Example 1: Stable TRAIL proteins 
Methods 

All reagents were of analytical grade unless specified otherwise. Isopropyl-P-D-l-thiogalactoside 
20 (IPTG), ampiciUin and diihiotreitol (DTT) were from Duchefa. Chromatographic columns and 
media were from Amersham Biosciences. Restriction enzymes used were purchased from New 
England Biolabs. 

All other chemicals were from Sigma. 
Computational design of mutants 

25 A detailed description of the protein design algorithm. PERLA, is available elsewhere (Lacroix,E. 
Protem design: a computer based approach, Ph.D.thesis. (U. Libre de Bruxelles,1999)) 
(ht.p://ProteinDesign£MBL-Heidelberg.DE) and its use has been previously described (see, for 
example. Ventura et aL, Nat. Struct. Biol 9. 485-493 (2002). In the case of oligomeric proteins like 
T^, protein design with PERLA requires Ihe foUowing steps: Firstly, residues of a monomer 

30 that could establish specific interactions with the contiguous monomer must be identified and 
selected. Secondly, side chains that contact the residues to be mutated, must be identified to allow 




side chain movements that are necessary to accommodate the new residues introduced by m§ 
algorithm. PERLA automatically selects these residues based on a geometrical approach that takes 
Ca-Ca distances and the angle between Ca-Cp vectors into consideration. Thirdly, the algorithm 
places the amino acid repertoire at each position selected from a set of naturally occurring amino 
5 acids in a multiple sequence alignment of the TNF ligand family, and eliminates those side chain 
conformations and amino acids that are not compatible with the rest of the structure. Fourthly, all 
possible pair-wise interactions are explored to eliminate those combmations that are less 
favourable. Finally, an output of sequences and PDB coordinates corresponding to the best 
calculated solution (in terms of energy) is produced. The resultant PDB files containing the 
10 mutations were energy minimized using GROMOS 43B1 as implemented in Swiss-Pdb Viewer 
v3.7b2 (Guex, & Peitsch. Electrophoresis 18, 2714-2723 (1997))), and evaluated by FOLD-X 
(http://fold-x.embl-heidelberg.de). The final energies of the models are compared to the reference, 
wild-type TRAIL structure and expressed as AAG (kcal mol"^). 

Cloning and PCR 

15 cDNA corresponding to human soluble TRAIL (aa 114-281) was cloned in pETlSB (Novagen) 
using Ncol and BamHL restriction sites. The N-terminal sequence encoding a His-tag and protease 
recognition site was therefore removed. Mutants were constructed by PCR using the Quick Change 
Method (Stratagene) or a modified megaprimer method (Picard et aL, Nucleic Acids Res. 22, 2587- 
2591 (1994)). The polymerase used was PJU Turbo supplied by Stratagene. Purified mutagenic 

20 oligonucleotides were obtained from Invitrogen. Introduction of mutations was confirmed by DNA 
sequencing. 

Expression and purification of wild-type TRAIL and mutants 

The wild-type TRAIL and TRAIL mutant constructs were transformed to Escherichia Coli BL21 
(DE3) (Invitrogen). Wild-type TRAIL and Ml were grown at a 5 1 batch scale in a 7.5 1 fermentor 

25 (Applicon) using 4 x LB medium, 1 % (w/v) glucose, 100 |Lig/ml ampicillin and additional trace 
elements. The culture was grown to mid-log phase at 37 ""C, 30 % oxygen saturation and 
subsequently induced with 1 mM IPTG. ZnS04 was added at a concentration of 100 \xM to 
promote trimer formation. Temperature was lowered to 28 °C and the culture was grown until 
stationary phase. Other mutants were grown in shake flasks at a 1 1 scale at 250 rpm, using a 

30 similar protocol. Protein expression was induced when the culture reached ODeoo 0.5 and induction 
was continued for 5 h. In this case, the medium used was 2 x LB without additional trace elements. 

The isolated pellet was resuspended in 3 volumes extraction buffer (PBS pH 8, 10% (v/v) glycerol, 
7 mM (3-mercapto-ethanol). Cells were disrupted using sonication and extracts were clarified by 
centrifugation at 40,000 g. Subsequently, the supernatant was loaded on a nickel charged IMAC 
35 Sepharose fast-flow column and wild-type TRAIL and TRAIL mutants were purified as described 
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by Hymowitz (Hymowitz et al. Biochemistry 39. 633-640 (2000)) with tiie following 
modifications: 10 % (v/v) glycerol and a minimal concentration of 100 mM NaCl were used m all 
buffers. This prevented aggregation during purification. After the IMAC fi-actionation step, 20 pM 
ZnS04 and 5 mM of DTT (instead of (S-mercapto-ethanol) was added in all buffers. Finally, a 
gelfiltration step, using a Hiload Superdex 75 column, was included. Purified proteins were more 
than 98 % pure as determined using a colloidal coomasie brilliant blue stained SDS-PAGE gel. 
Purified protem solutions were flash fi-ozen in liquid nitrogen and stored at -80 °C. 

CD Spectroscopy 

CD spectra were recoxxied on a Jasco J-715 CD spectrophotometer (Jasco Inc.) equipped with a 
PFD350S Peltier temperature control unit (Jasco Inc.). Rectangular quartz cuvettes with a 
paAlengtti of 0.2 cm were used. Protein samples were dialyzed against PBS pH 7.3 and adjusted to 
a final concentration of 100 ng/ml. Wavelength spectra were recorded between 250-205 nm using a 
0.2 mn stepsize and 1 nm band-width at 25 "C. Temperature scans fi-om 25-98 °C were performed 
at 222 mn with a scan rate of 40 °C/h. Thermal decay measurements were performed at 73 °C for 1 
h at 222 nm. 

Bioactivity of TRAIL mutants in vitro 

Bioactivily of wild-type TRAIL and TRAIL mutants was determined using a viability assay 
according to *e manufacturer's instructions (Celltiter Aqueous One, Promega). Colo205 human 
colon carcinoma cells (ATCC number CCL-222) were cultured in RPMI 1640 Glutamax 
containing 10 % heat inactivated fetal calf serum and 100 units/ml PeniciUin-Streptomycm. All 
reagents were supplied by Invitrogen. A concentration series of the wild-type TRAIL or TRAIL 
mutants was made in cell culture medium. 50 ^ of each dilution was added to a 96-well t^sue 
culture micro plate (Greiner) and 100 ^1 of cell suspension was added, to a final cell number of 
1x10* cells/well. Mixtures were incubated for 16 h at 37 °C under a humidified atmosphere 
25 containing 5 »/o CO.. Subsequently, 20 ^ of MTS reagent was added. Cell viability was determined 
after 30 min mcubation by measurmg the absorption at 490 nm. 
Receptor binding 

Binding experiments were performed using a surface plasmon resonance-based biosensor Biacore 
3000 (Biacore AB, Uppsala, Sweden), at 25 °C. Recombinant receptors were ordered fi-om R&D 
30 systems (R&D systems, Minneapolis, MN. USA). Immobilization of the receptors on the sensor 
surface of a Biacore CMS sensor chip was performed following a standard amine couplmg 
procedure according to the manufacturer's instructions. A reference surface was generated 
simultaneously under the same conditions but without receptor injection and used as a blank tx, 
correct for instrument and buffer artifacts. Purified wild-type TOAIL and TRAIL mutants were 
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injected in two-fold at a concentration of 2 fxg/ml and at a flow rate of 20 jul/min flow rate. Bindi^^ 
of ligands to the receptors was monitored in real-time. The receptor/sensor surface was regenerated 
using 3 M sodium acetate pH 5.2 injections. 

Computer screening 

5 Novel mutants of TRAIL have been designed in order to increase the stability of the bioactive 
trimer. Predictions were based on the automated computer algorithm, PERLA, as described above. 
Briefly, the program performs strict inverse folding: a fixed backbone structure is decorated with 
ammo acid side chains from a rotamer library. Relaxation of strain in the protein structure is 
achieved via the generation of subrotamers. Most terms of the scoring function are balanced with 

10 respect to a reference state, to simulate the denatured protein. The side chain conformers are all 
weighted using the mean-field theory and finally candidate sequences with modelled structures 
(PDB coordinates) are produced. Energy evaluation of the modelled structures was carried-out by a 
modified version (Schymkowitz, J., Borg, J., Rousseau, F. & Serrano, L, "manuscript in 
preparation") of FOLD-X, available at (http://fold-x.embl-heidelberg.de). The force field module of 

1 5 FOLD-X evaluates the properties of the structure, such as its atomic contact map, the accessibility 
of its atoms and residues, the backbone dihedral angles, m addition to the H-bond network and 
electrostatic network of the protein. The contribution of water molecules making two or more H- 
bonds with the protein is also taken into account. FOLD-X then proceeds to calculate all force field 
components: polar and hydrophobic solvation energies, van der Waals' interactions, van der Waals' 
20 clashes', H-bond energies, electrostatics, and backbone and side chain entropies. 

Selection of the template sequence 

The template selected was 1DU3 (Cha et ah, J, Biol Chem, 275, 3 1171-3 1177 (2000)). The crystal 
structure at 2.2 A resolution contains the trimeric structure of human TRAIL in complex with the 
ectodomain of the DR5 (TRAIL-R2) receptor. The TRAIL monomer lacks an external, flexible 
25 loop (130-146), not involved in receptor binding or in monomer-monomer interaction. To complete 
the molecule, this loop was modelled using the structure of 1D4V (2.2 A) (Mongkolsapaya et ah, 
Nat Struct Biol 6, 1048-1053 (1999)), a monomeric TRAIL in complex with DR5 (TRAIL-R2) 
receptor, having the atomic coordinates of the loop. Finally, the TRAIL molecule was isolated by 
removing the receptor molecules Jfrom the PDB file. 

30 Computational design of mutants 

The visual inspection of the isolated monomers, monomer-monomer interface and central core of 
TRAIL showed several residues as potential candidates for mutagenesis. The highly conserved 
hydrophobic residues were discarded from this list, as well as residues involved in receptor 
binding. These residues could not be mutated without disrupting interactions with the receptor. One 
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TRAIL variant (M2), however, that showed a significant predicted increase in stability but also 
contained residues involved in receptor interaction, was retained for subsequent experimental 
analysis. 

The sequence space search for every position was simplified by checking the naturally occurring 

5 amino acids in a multiple sequence alignment of proteins belonging to the TNF ligand family, thus 
decreasing the computing time, and subsequently focusing on non-conserved residues. The use of 
protein rational design and force field algorithms (PERLA, FOLD-X) allowed the identification of 
a list of mutant sequences with potential relevance for TRAIL stabiUly. Four sets of residues were 
selected for design (Figure lb and Table 1): (1) non-conserved residues at the surface of the 

1 0 monomer ('monomer' set), (2) non-conserved residues near positions close to the interface between 
two monomers ('dimer' set), (3) non-conserved residues along the central trimeric axis ('trimer' 
set) and (4) a miscellaneous set ('misc. set'). The automated computer algorithm PERLA was 
applied as previously described (Angrand et al., Biomol Eng 18, 125-134 (2001)). Amino acid 
substitutions were introduced at the non conserved residue positions in conformations (side chain 

15 rotamers) compatible with the rest of the structure. Subsequently, favourable mutations were 
combined and evaluated in terms of free energy (kcal mol"*), and unfavourable combinations (e.g. 
high Van der Waals' clashes) were eliminated. An output of sequences and coordinates was 
produced and ranked in terms of free energy using FOLD-X and subsequently reintroduced in the 
design algorithm for a T\ 2>'^ or 4* round of design, if necessary. Table 1 summarizes the list of 

20 mutants assayed in silico for increased stability of TRAIL. Some of these predictions were 
discarded dkectly after theoretical energy calculations, without further experimental analysis. 

Results 

Description of the tested mutations 

Predicted mutants were energy minimized and subsequently analyzed with FOLD-X. The energy 
25 values obtained were compared to that of the wild-type structure and used for discrimination of 
candidates. Mutants were selected based on an improvement in fi«e energy relative to wild-type 
TRAIL (Table 2). In the monomeric set, Ml (E194I, I196S) was selected because of the large 
unprovement of energy compared to wild-type TRAIL (AAG = -9.7 kcal mol"' monomer*). This 
high energy value is due to the feet that a trimer is being studied, in addition to the presence of 
30 significant van der Waals' clashes in the crystal structure (~5 kcal mor' monomer *), which are 
removed upon mutation. The mutations are located in the external loop connecting the C and D 
anti-parallel beta strands (CD loop), following the notation accordmg to Eck (Eck et al., J. Biol 
Chem. 267, 2119-2122 (1992)). The predicted increase in stability of Ml can be explained since 
Glu 194 is surrounded by hydrophobic groups (Trp 231, Phe 192, Ala 235) and Ihe carboxyl group 
35 is uncompensated. The mutation Glu 194 to He rectifies this situation by replacing the charged 




residue for a medium sized hydrophobic residue. Conversely, He 196 is surrounded by pol^ 
residues (Asn 202, Lys 233) and is very close to the backbone, resulting in probable van der Waals' 
clashes. Mutation to Ser avoids clashes and allows formation of a hydrogen bond to Asn 202, 
located in the opposite part of the CD loop (Figure 6a). Both mutations improve polar solvation 
5 energy, in addition to ameliorating side chain and backbone entropy. 

In the dimeric set (Table 2), the design of M2 (D203I, Q205M, Y237F) leads to the creation of a 
hydrophobic cluster to stabilize the interaction between residues 203 and 205 (D strand) of one 
monomer, and residue 237 (F strand) of the adjacent monomer. Gin 205 and Tyr 237 together form 
an intermolecular hydrogen bond, and Asp 203 points to a gap in the monomer-monomer interface. 

10 Mutation to He (203), Met (205) and Phe (237) breaks the Q205-Y237 hydrogen bond, but 
facilitates the tight packing of these residues, improving van der Waals' interactions, hydrophobic 
and polar solvation energies of the entire TRAIL molecule, without a further increase of van der 
Waals' clashes (Figure 6b). Although FOLD-X predicted that the affinity of M2 for the DR5 
(TRAIL-R2) receptor is lower (AAGbinding=7.3 kcal mol"^ monomer ^) than for wild-type TRAIL, 

1 5 this mutant was retained as a control to evaluate the accuracy of the procedure. 

Residue 225 of M3 (S225A), belonging to the 'Miscellaneous set% is located in strand E and is 
solvent exposed in the monomeric form. However, after trimerization, this position becomes buried 
in a small pocket, leaving the side chain of the hydrogen bond donor Ser uncompensated. After 
mutation to Ala, the energy of the model is better than wild-type TRAIL for both polar and 
20 hydrophobic solvation energies, in addition to side chain entropy. 

The Arg 227 residues of the trimeric set mutant (M4) are located in strand E, equidistantly opposed 
in a central position along the longitudinal axis of the TRAIL trimer. The three arginines are 
surrounded by hydrophobic (Ile242), polar (Ser241, Ser225) and aromatic (Tyr 240, Tyr 243) 
residues. These tyrosines direct the hydroxyl groups away from Arg 227, thus creating a rather 
25 hydrophobic cavity. The high concentration of positive charges is apparently not well compensated, 
since it forms only hydrogen bonds with the backbone (carbonyl groups of Ser241). Thus, the 
mutation of these positions to Met could help to accommodate the hydrophobic environment, as 
well as to decrease the repulsion of monomers due to uncompensated positive charges. 

Mutagenesis and Purification of Mutants 

30 The highest ranking mutants Ml and M2 together with M3 and M4, were selected for further 
experimental analysis (Table 2). A mutant (CI) combining the mutations of Ml and M3 was also 
constructed. All the designed TRAIL mutants were expressed in E. coli and purified successfully 
with a protein yield of 0.7-2 mg/1. Far-UV CD wavelength spectra indicated that all mutants were 
properly folded with characteristics of a p-sheet containing protein, similar to that of wild-type 

35 TRAIL. Gel-filtration and dynamic light scattering measurements showed that all mutant protein 
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solutions contained a single molecule species, consistent with a trimeric oligomerization state. 
Analytical ultracentrifugation with wild-type TRAIL and Ml corroborated this finding (data not 

shown). 

In vitro bioactivity and binding of designed mutants 
5 Bioactivity of the TRAIL mutants was assessed in vitro using the Colo205 human colon cancer cell 
line with a MTT based viability assay. A reduction in viability was measured using increasing 
concentrations of wild-lype TRAIL or TRAIL mutants relative to the control. While Ml, M3 and 
CI showed a bioactivity comparable to that of wild-type TRAIL (ED50 ~5 ng/ml), M2 exhibited 
bioactivity of nearly one order of magnitude lower (ED50 -50 ng/ml). Real-time binding of wild- 

10 type TRAIL and TRAIL mutants to tiie death receptors DR4 (TRAIL-Rl) and DR5 (TRAIL-R2) 
was assessed using surface plasmon resonance with a Biacore 3000 instrument. Sensograms of Ml, 
M3 and CI were identical to that of wild-type TRAIL. In contrast, M2 whilst showing a similar 
level of binding to both receptors, displayed an increased off-rate when compared to the wild-type 
TRAIL sensogram (Figure 2). 

1 5 Thermal tmfolding 

The thermal unfolding of wild-lype TRAIL and TRAIL mutants was monitored by measurmg 
changes in molar ellipticity at 222 nm upon heating. Figure 3 shows the heat induced changes of 
wild-type TRAIL and TRAIL mutants. TRAIL shows an onset of unfolding at approximately 70 °C 
and has a transition midpoint of 77 °C. The TRAIL mutants show however, onset of unfolding at 
20 increased temperatures and higher transition midpomts (Figure 3). For Ml the onset of unfolding 
was at approximately 76 ° C and the transition midpoint was at 85 °C. M2 showed an onset of 
unfolding at approximately 74 °C. M3 gave intermediate values between those of wild-type TRAIL 
and Ml, with an onset of unfoldmg of 73 °C and a transition midpoint of 80 "C. Mutant CI, 
representing the combined mutations of Ml and M3 showed values comparable to that of Ml. The 
25 mutant belongmg to the trimeric set (M4), however, showed an experimentally determined stability 
of approximately 3 °C less than wild-lype TRAIL, and was therefore discontinued. The initial 
increase in molar ellipticity around 76 °C for M2 is due to an overall change of the far UV 
spectrum, reflecting a loss of structural properties of flie starting material (data not shown). Upon 
cooling all protein solutions were turbid, indicating irreversible aggregation, tiierefore no 
30 thermodynamic parameters could be derived. Far and near UV wavelength CD scans at increasing 
temperatures confirmed the above findings (data not shown). 
Accelerated thermal stability study 

In order to test the stabiUty of TRAIL and TRAIL mutants over time, an accelerated thermal 
stability measurement was performed. The temperature of 73 " C was chosen to measure effects on 
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stability within a 1 h timeframe. At this temperature wild-type TRAIL starts to unfold, while th^ 
mutants are still properly folded (Figure 3). Protein solutions with the same concentration as used 
in the thermal unfolding measurements were incubated at 73 ^ C for 1 h and changes in molar 
ellipticity at 222 nm were measured (Figure 4). The ellipticity of wild-type TRAIL decreased from 
the onset, giving a half-life of approximately 13 min. The signal for the Ml, M2 and CI mutants 
remained essentially constant, indicating an increased thermal stability. M3 showed a half-life of 
approximately 24 min. These measurements, however, are not indicative of the bioactive trimeric 
structure of the TRAIL molecule, but of the secondary structure of the monomeric unit. To monitor 
a concomitant increase in biological activity at elevated temperatures of the mutants with 
unchanged biological activity (Ml, M3 and CI), protein solutions with the same concentrations as 
used in the thermal unfolding measurements were incubated at 73 ° C and samples were taken at 
regular intervals for 1 h. Samples were subsequently diluted in tissue culture medium and added to 
Colo205 cells, resulting in a final concentration of 100 ng/ml. After overnight incubation the 
viability of the cells was measured using a MTT assay. Wild-type TRAIL showed decrease in 
bioactivity after 20 min of incubation, while Ml and CI retamed frill bioactivity after incubation at 
73 ° C for 1 h (Figure 5), M3 displayed an intermediate bioactivity between wild-type TRAIL and 
the other mutants. The increases in thermal stability of the mutants as measured with CD could 
therefore be correlated with a more stable biologically active trimeric molecule. 

Discussion 

Others have previously applied computational engineering techniques to improve thermal stability 
of alpha-helical proteins or monomeric beta-sheet molecules. However, frequently, monomeric 
proteins of less than 100 amino acids were used as targets. To our knowledge, this report is the first 
example of computational redesign of a large trimeric all-p-sheet protein towards a more thermal - 
stable variant. Significantly, it shows that the principles learned from design and engineering of 
small proteins can also be applied for large multimeric protein complexes. 

The wild-type TRAIL (114-281) molecule has a relatively high thermal stability if compared to 
some members of the TNF ligand family. Human tumor necrosis factor alpha (TNF-a), for 
example, has an apparent T™ of 65 °C as measured with cu-cular dichroism (CD) (Narhi et al. 
Biochemistry 35, 11447-11453 (1996)) and the CD40L receptor binding domain has an apparent 
Tm of 60 °C as measured with differential scanning calorimetry (DSC) (Morris et aL, J, Biol Chem. 
274, 418-423 (1999)). In parallel mvestigations, we can show using CD that RANKL however, is 
more thermal stable than TRAIL, with an apparent Tm of 5 °C higher than wild-type TRAIL, 
confirming another study (Willard et al. Protein Expr. Purif. 20, 48-57 (2000)). In this study, we 
investigated the possibility of further increasing the thermal stability of TRAIL, as a model for all- 
P-sheet proteins, through the use of computational engineering. 
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We succeeded in extending the apparent thermal stability of the p-sheet protein by more than 5 °C 
by using a combined approach, employing both TNF ligand famUy alignment information and an 
automated computational design algorithm. Due to the non-reversible nature of the unfoldmg 
reaction, the apparent T„ is not a perfect indication of an increase in stability. From a functional 
5 point of view, therefore, it also makes sense to study the time taken for the protein to denature at 
high temperature and to relate this to an effect on biological activity. The accelerated thermal 
stability study showed that the increase in thermal stability of the mutants as measured with CD 
spectroscopy (Figure 4) can be correlated with the preservation of overall structural characteristics 
as highlighted by the lasting bioactivity of Ml during the experimental timeframe (Figure 5). When 
1 0 measuring the residual bioactivity of wild-type TRAIL and TRAIL mutants upon incubation at 73 
°C for 1 h, it was shown that, while wild-type TRAIL was all but thermally inactivated after -20 
min, the mutants, significantly, had an improved stability wi& respect to wild-type TRAIL (Figure 
5). Thus measurmg the stabiUty of wild-iype TRAIL and Ml at 73 °C is in this case indicative of an 
increased stabUity for Ml at more relevant temperatures, such as 37 °C or room temperature. 
15 Although not tested in this study, it has been shown that in case of certain therapeutically 
interesting proteins, improvement of thermal stability can also be indicative of an improved in vivo 
half-life (Luo et al.. Protein Sci. 11, 1218-1226 (2002); Filikov et al. Protein Sci. 11, 1452-1461 
(2002)). We are currently conducting studies to confirm this for our mutants. 

It is advantageous to use alignment information in order to focus the design on non-conserved 
20 residue positions. The reason for this is that conserved residues are usually retained in a family for 
a good reason and it is probable that any mutation will decrease protein stability (Serrano et al, J. 
Mol. Biol. 233, 305-312 (1993); Steipe et al, J. Mol. Biol 240, 188-192 (1994)). On the other 
hand, regions with high sequence variability are tolerant to mutation and it can be expected that 
variants that stabilize the protein can be found in these regions (Serrano et al., J. Mol Biol 233, 
25 305-3 12 (1993)). To accomplish our goal of redesigning a p-sheet protein, TRAIL, and to generate 
stable variants with the minimum number of mutations, the conserved residues forming the truneric 
interface were therefore largely excluded fi-om the prediction/optimization strategy. This resulted in 
an approach which focused mainly on improvement of the stability of the monomer (mtra-chain 
stabilization; monomeric set) or improving monomer-monomer contacts (inter-chain stabilization; 
30 dimeric set). See Table 1. 

Ml, M2, M3 and CI showed, in agreement with our predictions, an increase in thermal stability 
(Table 2; figure 3-5). Different basic principles were used in the Ml, M2 and M3 designs. Ml 
shows an example of intra-chain stabilization. Stabilization of the flexible CD loop at the surface of 
each TRAIL monomer results in an increased stability of the entire trimer. This loop is not directly 
35 involved in receptor binding and is disordered in un-complexed wild-type TRAIL structures (Cha 
et cd.. Immunity. 11. 253-261 (1999); Hymowitz et al. Biochemistry 39, 633-640 (2000)), but 
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becomes ordered on binding to DR5 (TRAIL-R2) (Mongkolsapaya et ah^ Nat Struct Biol 6, 104^ 
1053 (1999); Hymowitz et al, Mol Cell 4, 563-571 (1999); Cha et ah, J- Biol Chem. 275, 31171- 
31177 (2000)). M2, however, illustrates the optimization of the interactions between two adjacent 
monomers, i.e. inter-chain stabilization. M4 displays the stabilization of the trimeric molecule by 
5 removing a buried unsatisfied hydrogen bond donor. Contrary to our expectations, the combination 
mutant, CI (Ml and M3 combined) did not result in significant additive thermal stability. This 
might be due to the effects of local unfolding around residue 194 and 196, which could be more 
dominant than the effects of imfolding around residue 225. Although the predicted free energy 
change relative to wild-type TRAIL (-9.1 kcal mol"^ monomer^) is favourable for M4, the 

1 0 experimentally determined stability was approximately 3 °C less than wild-type TRAIL. This is 
probably due to the fact that, the three central arginines, in addition to the hydrogen bonds formed 
with the backbone, trap water in the central core of the trimer. Water bridges are thus formed to 
compensate the positive charges and this results in further stabilization of the trimer. The mutation 
Arg 227 Met is presumably less stable since the backbone's carbonyl groups are uncompensated 

1 5 and destabilize the trimer. 

The increase in thermal stability did not affect the biological activity of Ml, MS and CI. M2 was 
more stable than wild-type TRAIL but the formation of an electrostatic interaction between Gin 
205 and Arg 154 of the DR5 (TRAIL-R2) receptor was prevented (Figure 6b). This resulted in a 
subsequent 10-fold decrease in biological activity (50 ng/ml) when compared to wild-type TRAIL 

20 (5 ng/ml), as predicted by FOLD-X (AAGbinding=== 7.3 kcal mol"^ monomer"^). Our findings confirmed 
an earlier study showing decreased bioactivity of alanine mutants at these positions (Hymowitz et 
ah. Biochemistry 39, 633-640 (2000)). Analysis of binding to the DR4 (TRAIL-Rl) and DR5 
(TRAIL-R2) receptors, using surface plasmon resonance, shows an increased off-rate for M2, 
indicating a lower affinity for both receptors, when compared to wild-type TRAIL and Ml (Figure 

25 2). Since ligand-receptor binding sites are normally "high energy regions", the M2 mutations were 
expected to stabilize the TRAIL molecule. Thus, this could be regarded as an example of a possible 
increase in stability which is counterbalanced in evolution by loss of function. 

Frequently, other computational redesign studies limited the screening for improvement of thermal 
stability to the core of the molecule (Malakauskas & Mayo, Nat. Struct. Biol. 5, 470-475 (1998); 
30 Luo et ah. Protein Scl 11, 1218-1226 (2002); Filikov et ah. Protein Set 11, 1452-1461 (2002)). 
Here we show that computational redesign techniques can also involve inter-chain interfaces and 
surface residues of the molecule, to successfully stabilize the structure. 

Performance of PERLA/FOLD-X was successful in the case of the intra-chain (monomer) set, the 
inter-chain (dimeric) set and the miscellaneous set. The experimental data corresponding to these 
35 designs showed all variants within these sets were more stable than wild-type TRAIL. 
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Significantly, we could show that stabilization of the CD loop in a single monomer resulted in 
stabilization of the entire trimeric molecule (Figure 6a). 

Our studies have shown that computer redesign of a more thermal stable multimeric all P-sheet 
protein is achievable. Computational protein redesign is therefore a valuable addition to other 
5 protein engineering methodologies, such as directed evolution or experimental high throughput 
approaches, as a tool for the improvement of protein properties. Since the computational method 
used in our study is based on general physical principles, our findings can be further applied to 
design other TNF ligand family members with improved thermal stability. 

Example 2: TRAIL variants selective for the DR4 (TRAIL-Rl) or DR5 (TRAIL-R2) receptor 
10 Methods 

All reagents were of analytical grade unless specified otherwise. Isopropyl-P-D-l-thiogalactoside 
(IPTG), ampiciUin and dithiotreitol (DTT) were from Duchefa. Complete® protease inhibitor 
cocktail was purchased from Roche. Chromatographic columns and media were from Amersham 
Biosciences. Restriction enzymes used were purchased from New England Biolabs. All other 
1 5 chemicals were from Sigma. 

Computational design of mutants 

Computational design using the protein design algorithm, PERLA and FOLD-X has been described 
above. Similarly, the resultant PDB files containing the mutations were energy minimized using 
20 GROMOS 43B1 as implemented in Swiss-PdbViewer v3.7b2, and evaluated by FOLD-X 
(http://fold-x.embl-heidelberg.de). The final energies of interaction fi-om the designs of TRAIL 
mutants interacting with its different receptors are compared to the reference, wild-type TRAIL m 
complex with its four membrane receptors and expressed as AAG (kcal mol '). 

Computer screening 

25 Novel mutants of TRAIL have been designed in order to shift selectivity/specificity towards its 
different membrane receptors (DR4(TRAIL-R1), DR5(TRAIL-R2), DcRl (TRAIL-R3) and DcR2 
(TRAIL-R4)). These receptors are described by Pan et al. Science. 1997 Apr 4;276(5309): 111-3 
(DR4); Screaton et ah, Cuir Biol. 1997 Sep l;7(9):693-6 (DR5); Degli-Esposti et al., J Exp Med. 
1997 Oct 6;186(7):1 165-70 (DcRl (TRAIL-R3)) and Marsters et al, Curr Biol. 1997 Dec 

30 l;7(12):1003-6(DcR2(TRAIL-R4)). 

Designs were based on the automated computer algorithm, PERLA, as described above. Briefly, 
this algorithm performs strict inverse folding: a fixed backbone structure is decorated wifli amino 
acid side chains fi-om a rotamer library. Relaxation of strain in the protein structure is achieved via 
the generation of subrotamers. Most terms of the scoring flmction are balanced with respect to a 




reference state, to simulate the denatured protein. The side chain conformers are all weighted usin^ 
the mean-field theory and finally candidate sequences with modelled structures (PDB coordinates) 
are produced. Energy evaluation of the modelled structures was carried-out by a modified version 
(Schymkowitz, J., Borg, J., Rousseau, F. & Serrano, L, "manuscript in preparation") of FOLD-X, 
5 (available at http://fold-x.embl-heidelberg.de). The force field module of FOLD-X evaluates the 
properties of the structure, such as its atomic contact map, the accessibility of its atoms and 
residues, the backbone dihedral angles, in addition to the H-bond network and electrostatic network 
of the protein. The contribution of water molecules making two or more H-bonds with the protein 
is also taken into account. FOLD-X then proceeds to calculate all force field components: polar and 
10 hydrophobic solvation energies, van der Waals' interactions, van der Waals' clashes, H-bond 
energies, electrostatics, and backbone and side chain entropies. 

Selection of the template sequence 

Template was selected firom the Protein Data Bank, PDB identifier 1DU3. This is the crystal 
structure at 2.2 A resolution of the trimeric structure of human TRAIL in complex with the 
1 5 ectodomain of the DR5 (TRAIL-R2) receptor. In this structure TRAIL monomers lack an external, 
flexible loop (residues 130-146), not involved in receptor binding. To complete this template, this 
loop was modelled using the crystal structure of 1D4V (2.2 A), a monomeric TRAIL in complex 
with DR5 (TRAIL-R2) receptor, which has the atomic coordinates of this loop. 

Modeling TRAIL non-crystallized receptors 

20 Models of the three other TRAIL membrane receptors (DR4 (TRAIL-Rl), DcRl (TRAIL-R3) and 
DcR2 (TRAIL-R4)) were obtained using "What If Homology Modeling web interface" (available 
at http://www.cmbi.kun.nl/gv/servers/WIWWWI/) . Afterwards, pdb files of TRAIL in complex 
with these three receptors were generated by imposing their backbone atoms over the same atoms 
of the receptor DR5 (TRAIL-R2), using Swiss-Pdb Viewer v3.7b2. Finally, template receptor DR5 

25 (TRAIL-R2) was removed ft-om the generated PDB file. 

Modeling interactions between TRAIL and modelled receptors 

Rational design of the occurring interactions between TRAIL and the three modelled receptors was 
carried out in the following way: First, receptors' binding interface with TRAIL were studied 
looking for target amino acids for rotamer replacement. Selected side chains were the ones 
30 physically close enough to TRAIL to be potentially interacting with it. Conserved residues with 
receptor DR5 (TRAIL-R2) were discarded from this rotamer replacement. Second, PERLA 
performed a rotamer search looking for better side chains conformations, aiming to model the 
expected interactions of TRAIL with these receptors. Finally, an overall visual inspection of the 
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binding interface of TRAIL with the different receptors was carried out and some rotamers were 
changed. 

Computational design of mutants 

Only TRAIL amino acids located in the receptor-binding interface were considered as potential 
5 candidates for in silico mutagenesis. Residues interacting either with conserved amino acids among 
the four different receptors or with receptor backbone were discarded from this Ust. The use of 
protein rational design and force field algorithms (PERLA, FOLD-X) allowed the identification of 
a Ust of mutant sequences with potential relevance for TRAIL selectivity/specificity. 
The automated computer algorithm PERLA was applied as previously described. All possible 

1 0 amino acid substitutions (with the exception of Glycine, Proline, and Cysteine) were introduced at 
the previously selected residue positions in conformations (side chain rotamers) compatible with 
the rest of the structure. Subsequently, favourable mutations were evaluated in terms of free energy 
(kcal mol-1), and unfavourable mutations (e.g. high Van der Waals' clashes) were eliminated. An 
output of sequences and coordinates was obtained and ranked in terms of free energy using FOLD- 

15 X. Some of these predictions were discarded directly after theoretical energy calculations, wiliiout 
fiirther experimental analysis. 

Cloning and PCR 

cDNA corresponding to human soluble TRAIL (aa 114-281) was cloned in pETlSB (Novagen) 
using Ncol and BamHl restriction sites. The N-terminal sequence encoding a His-tag and protease 
20 recognition site was therefore removed. Mutants were constructed by PCR using the Quick Change 
Method (Stratagene) or a modified megaprimer method. The polymerase used was PJu Turbo 
supphed by Stratagene. Purified mutagenic oligonucleotides were obtained from Invitrogen. 
Introduction of mutations was confirmed by DNA sequencing. 

Screening for selectivity mutants 

25 TRAIL mutant constructs were transformed to Escherichia Coli BL21 (DE3) (Invitrogen). Mutants 
and wild-type TRAIL were grown at a 10 ml scale, using a 2x LB medium. Cultures were grown to 
ODfioo 0.5 at 37 °C and 250 rpm, and protein expression was subsequently induced with 1 mM 
IPTG. ZnS04 was added at a concentration of 100 pM to promote trimer formation. Ten^erature 
was lowered to 28 °C and induction was continued for 5 h. Cells were harvested by centrifugation. 

30 Pellet was resuspended in extraction buffer (PBS pH 7.3, 10% (v/v) glycerol and Conq>lete® 
protease inhibitor cocktail), in 25% of the original volume. Cells were disrupted using sonication 
and extracts were clarified by centrifugation at 20,000 g. TRAIL mutant protein expression was 
assessed using SDS-PAGE. Clarified extracts of mutants that were well expressed, were 
subsequently diluted 1:50 in HBS-EP buffer. These dilutions of the wild-type TRAIL and TRAIL 
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mutants were injected in two-fold, at a flow rate of 50 |al/min on a Biacore 2000. A Biacore CA^^ 
sensor chip coated with the TRAIL receptors DR4 (TRAIL-Rl), DR5 (TRAIL-R2) and DcRl 
(TRAIL-R3) and with RANK (receptor) as control surface, was used. Binding of ligands to the 
receptors was monitored in real-time. The receptor/sensor surface was regenerated using 3 M 
5 sodium acetate pH 5.2 injections. Ratios of binding for the different receptors were calculated 
relative to DR4 (TRAIL-Rl) or DR5 (TRAIL-R2) binding, respectively. Mutants having different 
binding profiles compared to the binding profiles of wild-type TRAIL were chosen for subsequent 
analysis. 

Expression and purification of wild-type TRAIL and mutants 

1 0 The chosen mutants were grown in shake flasks at a 1 1 scale at 250 rpm and 37 °C, using a 2x LB 
medium. Protein expression was induced with 1 mM BPTG when tihie culture reached ODgoo 0.5, 
ZnS04 was added at a concentration of 100 \xM, Temperature was lowered to 28 °C and induction 
was continued for 5 h. The isolated pellet was resuspended in 3 volumes extraction buffer (PBS pH 
8, 10% (v/v) glycerol, 7 mM p-mercapto-ethanol). Cells were disrupted usmg sonication and 

1 5 extracts were clarified by centrifugation at 40,000 g. Subsequently, the supernatant was loaded on a 
nickel charged IMAC Sepharose fast-flow column and wild-type TRAIL and TRAIL mutants were 
purified as described by Hymowitz (see above) with the following modifications: 10 % (v/v) 
glycerol and a minimal concentration of 100 mM NaCl were used in all buffers. This prevented 
aggregation diiring purification. After the IMAC fractionation step, 20 |liM ZnS04 and 5 mM of 

20 DTT (instead of P-mercapto-ethanol) was added in all buffers. Finally, a gelfiltration step, using a 
Hiload Superdex 75 column, was included. Purified proteins were more than 98 % pure as 
determined using a colloidal coomasie brilliant blue stained SDS-PAGE gel. Purified protein 
solutions were flash frozen in liquid nitrogen and stored at -80 °C. 

Screening for selectivity mutants 

25 Extracts of 16 expressing mutants were evaluated for binding to DR4 (TRAIL-Rl), DR5 (TRAIL- 
R2) and DcRl (TRAIL-R3) receptors using surface plasmon resonance with a Biacore 3000 
instrument. Binding to murine RANK (receptor) was monitored as control as wild-type TRAIL 
does not bind to this receptor. Control extracts of BL21 (DE3) culture without an over-expression 
plasmid and of a BL21 (DE3) culture with plasmid over-expressing SH3 domain were also 

30 injected. No binding was observed for these extracts. The ratio of DR4 (TRAIL-Rl), DcRl 
(TRAIL-R3) or RANK (receptor) binding with respect to DR5 (TRAIL-R2) receptor binding 
(Figure 7) and of DR5 (TRAIL-R2), DcRl (TRAIL-R3) or RANK (receptor) binding with respect 
to DR4 (TRAIL-Rl) receptor binding (figure 8), were calculated. The ratios values obtained for 
TRAIL selectivity mutants binding were compared to the ratios obtained for wild-type TRAIL 
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binding. Receptor binding curves were also visually inspected for alterations in on and off rates 
when compared to the on and off rates of wild-type TRAIL. 

Two mutants, D269H and G160M, with a reduced binding to the DR4 (TRAIL-Rl) and unchanged 
binding to the DR5 (TRAIL-R2) receptor ("selective" for DR5 (TRAIL-R2)) were chosen for 
5 further analysis. We have shown an increased off-rate of D269H w.r.t. DR4 but this does not imply 
that we have seen an increased binding for DR5. Our current data does imply that we have directed 
the binding of this mutant towards DR5 binding, hence it is more selective/specific for the DR5 
receptor. Affinity can of course be improved after selectivity variants are chosen. I220M and 
E195R (not shown) were also elected, as they also showed a reduced binding to the DR4 (TRAIL- 
1 0 Rl) and increased binding to the DR5 (TRAIL-R2) receptor. The effects, however, were smaller 
than that of the previous two mutants. D218Y was chosen for further analysis, as it showed a small 
preference for binding to the DR4 (TRAIL-Rl) receptor, compared to wild-type TRAIL. R130E 
was chosen as it showed a small reduction m binding to the DcRl (TRAIL-R3) receptor (less 
"selective" for DcRl (TRAIL-R3)). 
1 5 Determination of Receptor binding 

Mutants R130E, G160M, D218Y, I220M and D269H were purified. Binding of the purified 
mutants was assessed m real time using surface plasmon resonance on a Biacore 3000 with a sensor 
chip as described above. Mutants and purified wild-type TRAIL were injected in a concentration 
series of 17.3, 34.5, 69, 138 and 276 nM at a flow rate of 70 \iXlwin. Measurements performed at 37 
20 °C to allow increased koff and dissociation was monitored for 20 min. Measured k„„ and kofr rates 
were used to calculate apparent Ka values for the mutants for the respective receptors. A global fit 
procedure was used and a 1:1 Langmuir interaction model. A biphasic behavior was found for the 
^si^ and koff. The preliminaiy apparent BCd values were calculated and are shown m table 3. 

In vitro studies of the TRAIL mutants D269H 
25 Experimental conditions 

Cell line and treatment. Colo205 colon cancer cells were maintained in RPMI1640 medium, 10% 

FCS, 1% penicillin, 1% streptomycin in humified incubator, 37 °C , in 5 %C02 environment. 

TRAIL receptor inhibitors (neutralising antibodies) were always added Ih before TRAIL addition. 

The Colo205 cells were seeded the day before the experhnent at 10^ cells/ml in 24 well plates, 
30 ImlMell were treated with increasing, concentration of antiDR4 and anti DR5 neutralising 

antibodies for Ih. 20ng/ml D269H was added to the cells and incubated for 2h and 30 minutes. 

After treatment, the cells were harvested by scraping them gently off the wells and spun down. 

Armexin V staining: 



^ 41 

Control or treated Colo205 cells were harvested and collected by centrifugation, washed once' 
Annexin V incubation buffer and resuspended in 400|uil fresh incubation buffer. 1 jal Annexin V 
was added to the samples, incubated at room temperature for 10 minutes and measured 
immediately on a FACSCalibur Flow cytometer (Beckton Dickinson), results being expressed as % 
5 of annexin V positive cells. 

Caspase assay: Colo205 cells were plated in 6 well plates at 200.000 cells/ml, 3ml/well on the day 
before the experiment. R2C16 (lOOng/ml and 500ng/ml), antiDR4 (TRAIL-Rl) and antiDRS 
(TRAIL-R2) neutralizing antibodies (Alexis) (200ng/ml) were added into the culture medium Ih 
before adding TRAIL (lOng/ml). Cells were harvested for caspase activity assay after 2h treatment 
10 with TRAIL. 

Cells were pelleted, washed twice in ice cold PBS, resuspended in 50ml PBS, 2x25ml cell 
suspension was snap frozen. Caspase enzyme activity was measured using fluorescently tagged 
DEVD-MCA (for caspase-3 and -7-like activity), or lETD-MCA (for caspase-8) tetrapeptides. The 
fluorescence intensity caused by the released MCA was measured kmetically in 25 cycles with 60 
1 5 sec intervals. The enzyme activity was calculated as nmole MCA released per minute by 1 mg total 
protein. 

MTT assay: Colo205 cells were seeded in 96 well plates at 200.000 cells/ml, lOOjul/welL Each 
treatment was carried out in triplicates. After 24h treatment SOOjxg/ml MTT stain was added to the 
wells and was incubated at 3TC for 3h. The reaction was stopped and the formazan precipitate was 
20 dissolved by adding 100p,l 20% SDS in 50% dimethylformamide. 

Results 

For the analysis of the apoptosis inducing potential of D269H TRAIL sensitive Colo205 colon 
carcinoma cells were treated with increasing concentrations of TRAIL (aa 1 14-298) or D269H for 
2 and 3 hours. Annexin V labelling of the apoptotic cells was used to monitor the level of cell death 
25 induced. 

Our results (see Figures 9, 10 and 11) revealed that the two ligands have comparable death 
inducing potential in Colo205 cells. Thus, the designed mutations in D269H did not decrease its 
cytotoxic potential significantly (Figure 12). 

In order to examine which TRAIL receptor is more involved in D269H induced death, 1 hour prior 
30 to D269H treatment of Colo205 cells, increasmg amounts of neutralizing antiDR4 or antiDRS 
antibodies were administered. The presence of the antiDR4 antibody failed to prevent death 
induced by D269H. On the other hand, already the lowest concentration (200ng/ml) of antiDRS 
antibody almost completely prevented cell death (Figure 13). These results suggest that D269H 
induces cell death primarily through ligation of TRAIL receptor 2 (DRS). Similar studies on 
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different cell lines will be required to prove that this effect is not cell type specific and these are 
going. 
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Table 1 Residues initially considered for design 



Monomer Set Dimer Set Trimer Set Misc. Set 



E194^ 
1196^ 



H125 


R227^ 


A123 




0230 


A272 


Y185 


Y240 


S225^ 


01 87 




V280 


S232 




F163 


D234 




A123 


Y237^ (D203,Q205) 




V208 


L239 






S241 






£271^ 






F274 







^ Used in subsequent rounds of design 

Mutants in parenthesis were added in subsequent rounds as interaction partners 
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Table 2 Computational design results 





AAGstability 


AAGbinding 


Set 


Mutations 


Ml 


-9.7 


0.4 


Monomer 


El 941, 1196S 


M2 


-4.0 


7.3 


Dimer 


D203I, Q205M, Y237F 


M3 


-7.0 


-0.5 


Misc. 


S225A 


M4 


-9.1 


-1.2 


Trimer 


R227M 


CI 


-11.4 


-0.9 


Combination 


M1+M3 



* Energy in kcal mol'', calculated per monomer 

* AGbi„ding= AG c™^,ex-(SAG Chain); AAGbtadi„g=AGbi„djng mutant -AGbi„<B„g wild-type 



Table 3. Apparent values for DR4 (TRAIL-Rl) and DR5 (TRAIL-R2) receptors 



DR4(TRAIL-R1) DR5(TRAIL-R2) 
(nM) (nM) 



Wt TRAIL 


0.6 


0.4 


G160M 


0.4 


0.5 


D269H 


0.6 


<0.4 


D218Y 


? 


? 


others 


N.D. 


N.D. 
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Table 4: Critical residues identified for receptor binding without distinction betweai DR4 ant 
DR5. 



Mutant 


Effect on DR4 binding 


Effect on DR5 binding 


I220H 


Highly decreases 


Hisfhlv decreases 


I220M 


Highly decreases 


Highly decreases 


R149D 


Decreases 




R149H 


No observable effect 


N^fi obRftTVflblf* pffisnt 

±^V/ \J\Jai^\, VCIil^Xw vrXXwV/lf 


E155M 


Highly decreases 


Highly decreases 


G160K 


Highly decreases 


Highly decreases 


G160M 


Highly decreases 


Highly decreases 


D218F 


Decreases 


Decreases 


D218Y 


Decreases 


Decreases 


D218R 


No observable effect 


No observable effect 


Table 5: Critical residues identified for selectivity. 


Mutant 


Effect on DR4 bindins 


Effect on DR5 binding 


R130E 


Decreases 


Slightly decreases 


G131R 


Decreases 


Slightly decreases 


D269H 


Highly decreases 


No observable effect 
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Table 6: The TNF ligand-receptor family and association with autoimmune disease 





Receptors 


Ftinetioii 


Hssociattou 


A PRTf 




Probfiibly co-stitnulator of B 
aiidT cells 


RA,SLE 


4.1BBL(TNrSFI>) 


4-lBB (TNFSFR9) 


Trcell 4XJsliniulator/regulator 


EAE,RA 


BAFF {TNFSF13W20)* 


TACLBCMA^AFF-R 


B cell survival/maturatiDB^ 


SlMf SSt RA 


CD30L<TNJ'BF8) 


CD30 (TNFSFR8) 


Modulator of T cell function 


SLE, RA, SS, autoimmune 
thyroid disea'M^^ primary 
biliary cirrhosis 






B cell survival, siiniuiation 
and diffecentiatjon 


SLE,SS.1BD,EAE 


FasL(TOFSFI?> 


FasCrNFSFR6),DcR3 


Apoptosls 


SLE, HAS, diabetes, 
autoimmune thyroid disease 
and autoimmune hepatitis 


GITRL{TNFSF18> 


GlTRtTNRSFRIB) 


T-celi oQstiimilator/rejgtiiator 


unclear 


Lilixrl 1 i 1 IN 1* of* 14 j 


Lrlp-K,t JtlV tlM, UCKiS 

<TNFSFR6B) 


T cell actiVtttion and 
fhyniocyte survival 


Diabetes, possibly RA 


LTa ( FNFSFl ) hQHism 


TTjvl n I? I TWI^Uf *? 14 V KM 


IVklTtjiTW niii t i rttt 
xii t iiii.suiid ci \fn 


RA^ SLE, IBD, MS» diabetes 


LIWP 


LTP-R,HVEM, DcR3 
tTNFSFR6B) 


ThJ respon.^, fyniph node 
development, splenic 
arcliitecture aiid organization 


RA, SLE, IBD, MS, diabetes 


OX^L{TNFSF4) 


OX40(TNFSFR4) 


T-cell CQStimulator 


E/\E,RA,IBD 


RANKt (TNFSFil)* 


RANK(TNFSFRilA), 
OPG 


Dendritic cell survival^ 
osteoclast formatioa 


RA 


TNF.CTNFSEjy*^ 


TNF-Rl(TNFSm I A),TNF- 
R2 (TNFSFRIB) 


Inflammation, splenic 


RA, SJLB. IBD. MS, ctiabetes 


TRML(TlHFlSFilBJ^ 


TRAIL-Rl <DR4, 
TNFSFRIOAX TRAIL-R2 
<DR5,TNFSFR10BK 
TRAILERS (DcRU 
TNFSFRiOQ, TRAIL-R4 
(DcR3, TNFSFRIOD), OPO 
(TNFSra 1 IB) 


Induces tumour a; II death 
Blocks T cell proliferation 


Favorable in RA model 
I'ossible autoimmune 
thyroid dlsea^; Possibly 
MS. 


TWEAK 


TWEAK-R 


may contribute to 
macrophage homeostasis by 
jnediating CD4 {Hh)T-OBll 
lulling of anlt^it-presenting 
niacropliages; Induce 
prolifemtiGBi in endothelial 
cells* 


Brain inflammation: 
anglogenesis. 


TWEPRIL 


Possibly TACI, BOVIA mid 
unknown 


Requites further elucidation 


Requires tether elucidation 



MS, multipie «:lerosis; RA, rheumatoid arlltrttls; SL% systemic lupus eiy UieinatDSBs; SS, Sjttrgen\^siyndromB; EAE,experhnBitfat 
anttkiamune encephalomyelit^^ IBB* Inflaimnatoiy how&l dts&ase. 




CLAIMS 



1. A computer-implemented method for the stabilisation of a p sheet multimeric cytokine, 
comprising the step of: 

mutating a residue in a monomer component of the multimeric cytokine protein so as to 
5 improve the free energy of the monomer or of the multimeric complex relative to the 

wild-type unmutated monomer component; 

wherein said mutated residue is non-conserved between homologous members of the 
cj^okine family. 

2. A method according to claun 1, wherein the non-conserved residue that is mutated is at 
10 the surface of the monomer component of the multimeric cytokine protein. 

3. A method according to claim 1, wherein the non-conserved residue that is mutated is near 
a position close to the interface between two monomer components of Ihe cytokine 
protein. 

4. A method according to claim 1, wherein in a trimeric cytokine protein, the non-conserved 
1 5 residue that is mutated is at a position along the central trimeric axis of the multimeric 

protein. 

5. A method according to any one of claims 1-4, wherein more than one non-conserved 
residue is mutated. 

6. A method according to any one of the preceding claims, wherein non-conserved residues 
20 are identified using a computer-implemented alignment algorithm. 

7. A method according to claim 6, wherein in an alignment between the candidate for 
mutation and other members of the same protein family, a conserved residue is one that is 
shared between at least 50% of the family. 

8. A method according to any one of the preceding claims, wherein a protein design 
25 algorithm is used to facilitate the identification of candidate residues for mutation. 

9. A method according to claim 8, wherein said method performs an energy calculation 
involving the following steps: 

a) identification of residues of a monomer tihiat could establish specific interactions 
with the contiguous monomer; 
30 b) identification of side chains that contact residues that are candidates for mutation; 
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c) at each residue position is placed each amino acid in a repertoire selected from a 
set of naturally occurring amino acids in a multiple sequence alignment of members of 
the same protein family, and any side-chain conformations and amino acids that are not 
compatible with the rest of the structure are eliminated; 
5 d) all possible pair-wise interactions are explored to eliminate those combinations that 

are not favourable. 

10. A method according to claim 9, wherein said energy calculation is carried 
computationally, taking into account the properties of the structure, including its atomic 
contact map, the accessibility of its atoms and residues, the backbone dihedral angles, in 

1 0 addition to the H-bond network and electrostatic network of the protein, the contribution 

of water molecules making two or more H-bonds with the protein, polar and hydrophobic 
solvation energies, van der Waals' interactions, van der Waals' clashes, H-bond energies, 
electrostatics, and backbone and side chain entropies. 

11. A method according to claim 10, wherein the method outputs a sequence and/or PDB 
1 5 coordinates corresponding to the best calculated solution. 

12. A method according to claim 11, wherein the sequence and/or PDB co-ordinates 
including the mutations are energy-minimized and the final predicted energies are 
compared to the reference, wild-type structure in terms of AAG (kcal mol-1). 

13. A p sheet multimeric cytokine whose sequence has been altered by a method according to 
20 any one of the preceding claims, so as to be more stable than the wild-type, unaltered 

cytokine protein. 

14. A cytokine according to claim 13, which is a member of the TNF ligand family. 

15. A cytokine according to claim 14, which is TRAIL. 

16. A cytokine according to claim 15, which is mutated in the soluble C-terminal portion of 
25 the molecule. 

17. A cytokine according to claim 15 or 16, which is mutated at one or more of the following 
positions: 

a) a non-conserved residue at the surface of the monomer component of the 
multimeric cytokine; 

30 b) a non-conserved residue close to the interface between two of the monomer 

components of the multimeric cytokine; 

c) for truneric cytokines, a non-conserved residue along the central trhneric axis; 
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d) a miscellaneous residue whose mutation is energetically favourable. 

18. A cytokine according to claim 17, which is mutated in the external loop that connects that 
C and D anti-parallel beta strands (the CD loop), following the notation according to Eck 
(Eck et al., J. Biol. Chem. 267, 21 19-2122 (1992). 

5 19. A cytokine according to claim 17 part a), which is mutated at one or both positions 194 
and 196. 

20. A cytokine according to claim 19, which is a TRAIL mutant containing the mutations 
E194I and/or I196S. 

21. A cytokine according to claim 17 part b), which is mutated at one or more of the positions 
10 125, 163, 185, 187, 232, 234, 237, 203, 205, 239, 241, 271, 274. 

22. A cytokine according to claim 21, which is a TRAIL mutant containing one or more of 
the mutations D203I, Q205M and Y237F. 

23. A cytokine according to claim 17 part c), which is mutated at one or more of positions 
227, 230 and 240. 

15 24. A cytokine according to claim 23, which is a TRAIL mutant containing the mutation 
R227M. 

25. A cytokine according to claim 23, which is a TRAIL mutant containing the mutation 
C230S and Y240F. 

26. A cytokine according to claim 17 part d), which is mutated at one or more of the positions 
20 123, 272, 225, 280, 163, 123 and 208. 

27. A cytokine according to claim 25, which is a TRAIL mutant containing the mutation 
S225A. 

28. A cytokine which is mutated at more than one position as listed in claim 17, parts a) to d). 

29. A cytokine according to claim 27, which is a TRAIL mutant containing the mutations 
25 E194I, I196S and S225A. 

30. A cytokine according to any one of claims 13-28, wherein the described mutations are 
introduced into a soluble form of the cj^okine. 

31. A cj^okine according to claim 29, which is a TRAIL mutant comprising residues 114- 
281. 

30 32. A method for the alteration of the selectivity of a p sheet multimeric cytokine for a target 
receptor, the method comprising 
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a) identifying amino acids in the cytokine that are located in the receptor-binding 
interface as candidates for mutation; 

b) discarding residues interacting with amino acids that are conserved among 
receptors bound by the cytokine protein; 

5 c) discarding residues interacting with the receptor backbone; and 

d) substituting each of one or more residues in the cytokine protein for replacement 
residues that include amino acid side-chain conformations that are predicted to fit mto the 
binding interface with the target receptor so as to provide an increase in binding affinity 
and selectivity/specificity of the cytokine protein for that target receptor. 
1 0 33. A p sheet multimeric cytokine whose sequence has been altered by a method according to 
claim 3 1 so as to alter its affinity for a particular target receptor. 

34. A cytokine according to claim 32, which is mutated at one or more of the positions 131, 
269, 130, 160, 218, 220, 149 and 155 in the cytokine. 

35. A cytokine according to claim 32 or 33, which is a member of the TNF ligand family. 
15 36. A cytokine according to claim 34, which is TRAIL. 

37. A cytokine according to claim 35, which has superior selectivity for the DR5 (TRAIL- 
R2) or DR4 (TRAIL-Rl) over the decoy receptors DcRl (TRAIL-R3) and DcR2 
(TRAIL-R4). 

38. A cytokine accordmg to claun 35, which has superior selectivity for the death receptor 5 
20 (TRAIL-R2) over selectivity for the death receptor 4 (TRAIL-Rl) 

39. A c^okme acconiing to claim 37, which contains one or more of the mutations G131R, 
D269H, R130E, G160K, D218R, G160M, D218Y, I220M, I220H, R149D, R149H, 
D218FandE155M. 

40. A cytokine according to claim 38, which contains the mutations G160M or D269H. 
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